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SEQUENCE LISTING 
SEQ ID NO: 1 amino acid sequence comprising GAS 40 

MDLEQTKPNQVKQKIALTSTIALLSA SVG 
KTLSQQKAELTELATALTKTTAEI 

TELHNAQ ADQH S KETAL SEQKAS I S AETTRAQDLVEQVKT S EQNI AKLNAMI SNPDAI TKAAQTANDNTKA 
LSSELEKAKADLENQKAKVKKQLTEEI^ 

LKKLEASGYIGSASYNNYYKEHADQI IAKASPGNQLNQYQDI PADRNRFVDPDNLTPEVQNELAQFAAHMI 
NSVRRQLGLPPVTVTAGSQEFARLLSTSYKKTHGOT^ 

RNDDNMYENIGAFNDVHTVNG I KRG IYDS IKYMLFTDHLHGOTYGHAINFLRVDKHNPNAPVYLGF STSNV 

GSIiNEHFVMFPESNIAimQRFNKTPIKAVGSTra 

AQAKVSQIiQGKIiASTLKQSDSLNIiQVRQIil!^ 

EAIjAEQAAARWAIjVAKKAHLQYIjRDFKIjNPNRL 

IiEATIATTEHQLTLLKTIANEKEYRHLDEDIAT^ 

EASARIiAAENTSLVAEALVGQTSElWASNAIVSKITS 

AGWMLAAVGL.TGFRFRKESK 

SEQ ID NO: 2 polynucleotide sequence encoding for GAS 40 

ATGGACTTAGAACAAACGAAGCCAAACCAAGTTAAGCAGAAAATTGCTTTAACCTCAACAATTGCTTTATT 
GAGTGCCAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAGTA 
ATACTCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCAAAGGCT^ACTATTGATGCAGTTGAA 
AAAACTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAACTACTGCTGAAAT 
CAACCACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAACCTCTGCACAAGAAATTTACACTAATA 
CTCTTGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTAACAGCTACTGAA 
ACAGAGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTCAGAACAAAAAGCTAGCAT 
TTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTGAAAACGTCTGAACAAAATATTGCTAAGC 
TCAATGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGATAATACAAAAGCA 
TTAAGCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAAAAAGCTAAAGTTAAAAAGCAATTGAC 
TGAAGAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTCTTAAATCCTCAG 
CTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTATCCTCTTGAAGAA 
CTTAAAAAATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTACAATAATTATTACAAAGAGCATGCAGA 
TCAAATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATACCAAGATATTCCAGCAGATCGTAATC 
GCTTTGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGCTAGCGCAGTTTGCAGCTCACATGATT 
AATAGTGTAAGAAGACAATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGAATTTGCAAGATT 
ACTTAGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGACAGCCAGGGGTAT 
CAGGGCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAGACTCTGCCGGAGCGTCAGGGCTCATT 
CGAAATGATGATAACATGTACGAGAATATCGGTGCTTTTAACGATGTGCATACTGTGAATGGTATTAAACG 
TGGTATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTTACACGGAAATACATACGGCCATGCTA 
TTAACTTTTTACGTGTAGATAAACATAACCCTAATGCGCC 

GGATCTTTGAATGAACACTTTGTAATGTTTCCAGAGTCTAACATTGCTAACCATCAACGCTTTAATAAGAC 
CCCTATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGCACTGTATCTGATACTATTGCAG 
CGATCAAAGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCTGATA'TTATGGCA 
GCCCAAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACACTTAAGCAGTCAGACAGCTTAAATCT 
CCAAGTGAGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTAAAGCAAAACAAG 
CACAACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCACTGCACCAGACA 
GAAGCCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCATTTGCAATATCT 
AAGGGACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGAGCGCATTGATAATACTAAGCAAGATT 
TGGCTT^AAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCCTTACAAGCTAAACAAAGCAGT 
CTAGAAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAACGAAAAGGAATA 
TCGCCACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTACGGGCGTAAAAC 
CGCTATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAATGGTTAAAGAAACGAAACAACTATTA 
GAAGCTTC AGC AAGATTAGCTGC TGAAAATAC AAGTCTTGTAGCAGAAGC GCTTGTTGGCC AAACC TC TGA 
AATGGTAGCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCCCTCATCTAAGACATCTT 
ATGGCTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTGATGAAAGTACTCAAAG AGCTCTTAAA 
GCAGGAGTCGTCATGTTGGGAGCTGTCGGCCTCACAGGATTTAGGTTCCGTAAGGAATCTAAGTGA 

SEQ ID NO: 3 amino acid sequence comprising an N terminal leader sequence of GAS 40 

MDLEQTKPNQVKQKIALTSTIALLSA 
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SEQ ID NO: 4 polynucleotide sequence encoding an N terminal leader sequence of GAS 40 

ATGG ACTTAGAAC AAAC GAAGCC AAAC CAAGTTAAGCAGAAAATTGCTTTAACCTCAACAATTGC TTTATT 
GAGTGCC 

SEQ ID NO: 5 amino acid sequence comprising a fragment of GAS 40 with N terminal leader 
sequence removed 

SVGVSHQVKADDRASGETKASlSrrHDDSIjPKPET^ 

LKEQQDNEQKALTSAQE I YTNTLAS S EETLIiAQGAEHQRELTATETEIjHNAQADQHSKETAIjSEQKAS I S A 
ETTRAQDLVEQVKTS EQNI AKLNAMI SNPDAITKAAQTANDNTKAL SSELEKAKADIiENQKAKVKKQIiTEE 
LAAQKAALAEKEAELSRLKSSAPSTQDSIVGNNTM^ 

I AKAS PGNQLNQYQDI PADRNRFVDPDNLTPEVQNELAQFAAHMINSVRRQLGIiP PVTVTAGSQEFARIiLS 

TSYKKTHGNTRPSFVYGQPGVSGHYGVGPHDKTI^ 

YDSIKYMLFTDHLHGNTYGHAIKnF^ 

KAVGSTKDYAQRVGTVSDTI AAI KGKVS SLENRLS AIHQEADIMAAQAKVSQLQGKLASTIjKQSDSLNIjQV 
RQIiNDTKGSLRTELIiAAKAKQAQLEATRDQSLAKLASLK^ 

FKLNPNRIiQVIRERIDNTKQDIiAKTTS SLLNAQEAIiAALQAKQS SLEATI ATTEHQLTLLKTLANEKEYRH 
LDEDIATVPDLQVAPPLTGVKPLS YS KIiyTT^ • 

ASNAIVSKITSSITQPSSKTSYGSGSSTTSNIjISDVDESTQRALKAGVVMLAAVGIjTGFRFRKESK 

SEQ ID NO: 6 polynucleotide sequence encoding a fragment of GAS 40 with N terminal leader 
sequence removed 

AGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAGTAATACTCA 
CGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCAAAGGCAACTATTGATGCAGTTGAAAAAACTC 
TCAGTCAACAAAAAGGAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAACTACTGCTGAAATCAACCAC 
TTAAAAGAGCAGCAAGATAATGAACAAAT^GCTTTAACCTCTGCACAAGAAATTTACACTAATACTCTTGC 
AAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTAACAGCTACTGAAACAGAGC 
TTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTCA^^ 

GAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAAAACGTCTGAACAAAATATTGCTAAGCTCAATGC 
TATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGATAATACAAAAGCATTAAGCT 
CAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAAAAAGCTAAAGTTAAAAAGCAATTGACTGAAGAG 
TTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTCTTAAATCCTCAGCTCCGTC 
TACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTATCCTCTTGAAGAACTTAAAA 
AATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTACAATAATTATTACAAAGAGCATGCAGATCAAATT 
ATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATACCAAGATATTCCAGCAGATCGTAATCGCTTTGT 
TGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGCTAGCGCAGTTTGCAGCTCACATGATTAATAGTG 
TAAGAAGACAATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGAATTTGCAAGATTACTTAGT 
ACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGACAGCCAGGGGTATCAGGGCA 
TTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAGACTCTGCCGGAGCGTCAGGGCTCATTCGAAATG 
ATGATAACATGTACGAGAATATCGGTGCTTTTAACGATGTGCATACTGTGAATGGTATTAAACGTGGTATT 
TATGACAGTATCAAGTATATGCTCTTTACAGATCATTTACACGGAAATACATACGGCCATGCTATTAACTT 
TTTACGTGTAGATAAACATAACCCTAATGCGCCTGTTTACCTTGGATTTTCAACCAGCAATGTAGGATCTT 
TGAATGAACACTTTGTAATGTTTCCAGAGTCTAACATTGCTAACCATCAACGCTTTAATAAGACCCCTATA 
AAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGCACTGTATCTGATACTATTGCAGCGATCAA 
AGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCTGATATTATGGCAGCCCAAG 
CTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACACTTAAGCAGTCAGACAGCTTAAATCTCCAAGTG 
AGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTAAAGCAAAACAAGCACAACT 
CGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCACTGCACCAGACAGAAGCCT 
TAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCATTTGCAATATCTAAGGGAC 
TTTAAATTGAATCC TAACCGC CTTC AAGTG ATAC GTGAGC GC ATTGAT AATACT AAGC AAGATTTGGCTAA 

AACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCCTTACAAGCTAAACAAAGCAGTCTAGAAG 
CTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAACGAAAAGGAATATCGCCAC 
TTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTACGGGCGTAAAACCGCTATC 
ATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAATGGTTAAAGAAACGAAACAACTATTAGAAGCTT 
CAGCAAGATTAGCTGCTGAAAATACAAGTCTTGTAGCAGAAGCGCTTGTTGGCCAAACCTCTGAAATGGTA 
GCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCCCTCATCTAAGACATCTTATGGCTC 
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AGGATCTTCTACAACGAGC^TCT^TTTC 

TCGTCATGTTGGCAGCTGTCGGCCTCACAGGATTTAGGTTCCGTAAGGAATCTAAGTGA 

SEQ ID NO: 7 amino acid sequence comprising a C terminal transmembrane region of GAS 40 

ALKAGWMLAAVGLTGFRFRKESK 

SEQ ID NO: 8 polynucleotide sequence encoding a C terminal transmembrane region of GAS 40 

GCTCTTAAAGCAGGAGTCGTCATGTOGGCAGCTGTCGGCC 
GTGA 

SEQ ID NO: 9 amino acid sequence comprising a fragment of GAS 40 with a C terminal 
transmembrane sequence removed 

MDLEQTKPNQVKQKIALTSTIALLSASVGVSHQVK^ 
KTLSQQKAELTELATALTKTTAEINHLK^ 

TELHNAQADQHSKETALS EQKAS I S AETTRAQDLVEQVKTSEQNIAKLNAMI SNPDAITKAAQTANDNTKA 
LS SELEKAKADLENQKAKVKKQLTEELAAQKAAIiAEKEAELSRIiKSS apstqds IVGNNTMKAPQGYPLtEE 
LKKLEASGYIGSASYNNYYKEHADQI IAKASPGNQLNQYQDI PADRNRFVDPDNLTPE^QNELAQFAAHMI 
NSVRRQLGLPPVTVTAGSQEFARLLSTSYKKTHGNTRPSFVYGQPGVSGHYGVGPHDOT 
RNDDNMYENIGAFNDVHTVNGI KRGI YDS I KYMLFTDHIjHGNTYGHAINFIjRVDKHNPNAPVYIjGFSTSNV 

GSLNEHFVMFPESNIANHQRFNKTPIKAVGSTKDYAQRVGTVSDTIAAIKGKVSSLENi^ 
AQAKVSQLQGKXiASTLKQSDSIiNLQVRQli^^ 

E^LAEQAAARVTALVAKKAHLQYtiRDFKLNPNRLQVI RERIDNTKQDLAKTTS SLLNAQEALAALQAKQS S 

LEATIATTEHQLTLIiKTIiANEKEYRHLDEDIATVPDL^ 

EASARLAAENTSIiVAEALVGQTSEMVASNAIVSKI 

SEQ ID NO: 10 polynucleotide sequence encoding a fragment of GAS 40 with a C terminal 
transmembrane sequence removed 

ATGGACTTAGAACAAACGAAGCCAAACCAAGTTAAGCAGAAAATTGCTTTAACCTCAACAATTGCTTTATT 
GAGTGCCAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAGTA 
ATACTCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCAAAGGCAACTATTGATGCAGTTGAA 
AAAACTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAACTACTGCTGAAAT 
C AACC AC TTAAAAG AGC AGC AAGATAATGAAC AAAAAGC TTTAAC C TCTGC AC AAGAAATTTAC AC TAATA 

CTCTTGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTAACAGCTACTGAA ■ 
ACAGAGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTCAGAACAAAAAGCTAGCAT 
TTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAAAACGTCTGAACAAAATATTGCTAAGC 
TCAATGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGATAATACAAAAGCA 
TTAAGCTCAG AATTGGAGAAGGC TAAAGC TGAC TTAGAAAATC AAAAAGCTAAAGTT AAAAAGC AATTG AC 

TGAAGAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTCTTAAATCCTCAG 
CTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTATCCTCTTGAAGAA 
CTTAAAAAATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTACAATAATTATTACAAAGAGCATGCAGA 
TCAAATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATACCAAGATATTCCAGCAGATCGTAATC 
GCTTTGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGCTAGCGCAGTTTGCAGCTCACATGATT 
AATAGTGTAAGAAGACAATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGAATTTGCAAGATO 
ACTTAGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGACAGCCAGGGGTAT 
CAGGGCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAGACTCTGCCGGAGCGTCAGGGCTCATT 
CGAAATGATGATAACATGTACG^GAATATCGGTGCTTTTAACGATGTGCATACTGTGAATGGTATTAAACG 
TGGTATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTTACACGGAAATACATACGGCCATGCTA 
TTAAC TTTTTACGTGTAGATAAAC AT AACC C TAATGC GC C TGTTTACC TTGGATTTTC AAC C AGC AATGTA 
GGATCTTTGAATGAAC ACTTTGTAATGTTTC C AGAGTCTAAC ATTGCTAAC C ATC AAC GC TTTAATAAGAC 

CCCTATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGCACTGTATCTGATACTATTGCAG 
CGATCAAAGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCTGATATTATGGCA 
GCCCAAGCTAAAGTAAGTCAACTTCAAGGTAAATOAGCAAGCACACTTAAG 

CCAAGTGAGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTAAAGCAAAACAAG 
CACAACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCACTGCACCAGACA 
GAAGCCTTAGCAGAGC^GCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCATTTGCAATATCT 
AAGGGACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGAGCGCATTGATAATACTAAGCAAGATT 
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TGGCTAAAACTACCTCATCTTTGTTA 
CTAGAAGCTACTATTGCTACCACAGAACACCAG 

TCGCCACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTAC 
CGCTATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAATGGTTAAAGAAACGA^CAACTATTA 

GAAGCTTC AGC AAGATTAGCTGC TGAAAATAC AAGTCTTGTAGCAGAAGC GC TTGTTGGC CAAAC C TC TGA 
AATGGTAGCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCCCTCATCTAAGACATCTT 
ATGGCTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTGATGAAAGTACTCAAAGA 

SEQ ID NO: 11 amino acid sequence comprising a transmembrane region of GAS 40 as shown 
in Figures 1 and 2. alkagwmlaavgltg 

SEQ ED NO: 12 amino acid sequence comprising a first coiled-coil region of GAS 40 

ETI QEAKATIDAVEKTLS QQKAELTEIATALT SEETLL 
AQGAEHQRELTATETELHNAQADQHSKETALSEQKAS I SAETTRAQDLVEQVKTSEQNXAKLNAMI SNPDA 
ITKAAQTANDNTKALS SELEKAKADXjENQKAKVXKQLTEELAAQKAAIjAEKEAELSRLKS S A 

SEQ ID NO: 13 amino acid sequence comprising a second coiled-coil region of GAS 40 

RLSAIHQEADIMAAQAKVSQLQGKLASTIjKQSDSL 
AKLASLKAALHQTEAIiAEQAAARVTAIjVA 
QEALAALQAKQS S LEATI ATTEHQLTLLKTtiANEKE 

SEQ ID NO: 14 amino acid sequence comprising a leucine zipper motif within the second 
coiled-coil region of GAS 40. 

QVIRERIDNTKQDLAKTTSSLLNAQEALAAIi 

■ 

SEQ ID NO: 15 amino acid sequence comprising SpA from Streptococcus gordonii Genbank 
reference GI 25990270 

MNKRKEVFGFRKS KVAKTLCGAVLGAAL I AI ADQQVIiADEVTETNSTANVAVTTTGNPATNIiPEAQGEATE 

AAS QSQAQAGS KEGALPVEVSADDLNQAVTDAKAAGVNV 

EEIKKTTEAYKKEVEAHQAETDKINAENKAAEDKYQEDLKAHQAE^ 

AAVQKANEDSQLDYQNKLSAYQAELARVQKANAEAK^^ 

DAAMKQYEADIiAAIKJCAKEDITOAD^ 

KQRNAAAKATYEAALKQ YE ADLAAAKKANED S DADYQAKLAAYQTELARVQKANADAKAAYEKAVEDNKAK 

NAALQAENE E I KQRNAAAKTD YEAKLAOT 

GYLSKPSAQSLVYDSEPNAQLSLTTNGKMLKASAVDEAFSH^ 

ELYGNFGDKAGWTTTVGNlSrTEVKFASVLL^ 

KAWLG\njPDPTLGVFASAYTGQEEKDTSIFIKNEFTFYDENDQPINFD^ 

GTFVKI SGS SVGEKDGKI YATETIjNFKQGQGGSRVJTMYKNSQPGSGWD S SD APNSWYGAGAI SMSGPTNJJV 

TVGAI SATQWP SD PVMAVATGKRPNIWYSIjNGKIRAVNVPKITKEKPTPPVAPTEPQAPTYEVEKPIjE PA 

PVAPTYENEPTPPVKTPDQPEPSKPEEPTYETEKPLEPAPWPTYENEPTPPVKTPDQPEPSKPEEPTYET 

EKPLEPAPVAPTYENEPTPPVKTPDQPEPSKPEEPTYDPLPT^ 

PQINKEIKNEDGVDIDRTLVAKQS3VKFELKT 

TYDEASHTVTFKATDETLATYIQADLTKPVETLH^ 

GKPNDPDNPNNNYIKPTKVNKNKEGI^ 

EEALDWPDLVKVADEKGNQVSGVSVQQYDSLEAAPKKVQDIiLKKAN 
TGTSLVITDPMTVKSEFGKTGGKYENKAYQIDFGNGYAT 

QLYQTFNYRLIGGF I PQNHSEELEDYSFVDDYDQAGDQYTGNYKTFSSLNIiTMKDGSVTKAGTDLTSQTTA 

ETDAANGIVTVRSKEDSLQKISLDSPFQAETYLQMRRIAIGTF^^ 

DKPTPIPTPKPKDPDKPETPKEPKVPSPKVEDPSAPIPVSVGK^ 

GLGQLKRKEDESN 

SEQ ID NO: 16 amino acid sequence comprising Streptococcal surface protein B precursor 
from Streptococcus gordonii Genbank reference GI 25055226 AAC44102.3 
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MQKREVFGFRKSKVAKTLCGAVLGAA^ 

ASQSQAQAGSKDGALPVEVSADDLNKAVTOAKAAGV^^ 

EIKKTTEAYKKEVEAHQAETDKINAENKAAED^ 

AVQKANEDSQIiDYQNKLSAYQAEIiARVQKANAEA 

AAMKQYEADLAAIKKAKEDlTOADYQAKLAAYQAEIiARVQKA^ 

QRNETAKATYEAAVKQYEADLAAVKQAl^TN^^ 

AAL QAENE E I KQRNAAAKTD YEAKLAK YEADLAKYKKD F AA YTAAL AEAE S KKKQDG YL S E P R S Q S LNF K S 

EPNAIRTIDSSVHQYGQQELDALVKSWGI SPTNPDRKKSTAYSYFNAINSNNTYAKLVLEKDKPVDVTYTG 

LKNS SFNGKKI SKVVYTYTLKETGFDDGTKMTMFASSDPTVTAWYNDYFTSTNINV^ 

GGLVNF S SliNRGNGSGAIDKDAIESVRNFNGRY I PI SG S S IKIHENNS AYADS SNAEKSRGARWDTSEWDT 

TSSPNNWYGAIVGEITQSEISFNMASSKSGNIWFAFNSNINAIGVPTKPVAPTAPTQPMYETEKPLEPAPV 

VPTYENEPTPPVKTPDQPEPSKPEEPTYETEKPLEPAPVAPTYENEPTPPVKIPDQPEPSKPEEPTYETEK 

PliEPAPVAPTYENEPTPPVKTPDQPEPSKPEEPTYDPLPTPPLAPTPKQLPTPPWPTVHFHYSSLIiAQPQ 

INKEI KNEDGVDIDRTLVAKQS IGKFELKTEALTAGRPKTTS FVLVDPLPTC 

DEASHTVTFKATDETLATYNADI/TKPVETLHPTW^ 

PNDPDNPNJmYIKPTKVNKTTC^ 

■ ALDVRPDLVKVADEKGNQVS GVS VQQYDSLEAAPKKVQDLLKKANITVKGAFQLFSADNPEEF YKQYVSTG 
TSLVITDPJmTCSEFGKTGGKYENKAYQIDFGN^^ 

YQTFNYRLIGGF I PQNHS EELEDYSFVDDYDQAGDQYTGNYKTFS SLNIjTMKDG S VI KAGTDLT S QTTAET 

DATNGIVTVRFKEDFLQKI SIiDS PFQAETYLQMRRI AIGTFENTYVNTVNKVAYASKTTVRTTTP I PRTPDK 

PTPI PTPKPKDPDKPETPKEPKVPS PKVEDPSAPI PVSVGKEIjTTLPKTGTNDATYMPYLGIiAAIiVGFIjGIj 
GIiAKRKED 

SEQ ID NO: 17 amino acid sequence comprising PspA from Streptococcus pneumoniae 
Genbank reference GI 282335 

MNKKKMI IiTSLAS VAILGAGFVASQPTVVRAEES PVASQS KAEKDYDAAKKDAKNAKKAVEDAQKAIiDDAK 
AAQKKYDEDQKKTEEKAALEKAASEEMDKAVAAVQQAYLAYQQATDKAAKDAADKMIDEAKKREEE 
NTVRAMVVTEPEQLAETKKKSEEAKQKAPELTKKLEEAKAKLEEAEKK^ 
NQVHRLEQELKEIDESESEDYAKEGFRAPLQSKLDAKKAKLSKIjEELSDKIDELDAEIAKL 
NWEDYFKEGLEKTIAAKKAEIiEKTEADLKKAVNEPEKPAPAPETPAPEAPAEQPKPAP 
PAEQPKPEKTDDQQAEEDYARRS EEEYNIUjTQQQPPKAEKPAPAPKTGV^QENGMWYF yntdg smatgwlq 
NNG S WYYLNSNGAMATGWL QYNG S WYYLNANGAMATGWAKVNG SWYYLNANGAMATGWLiQYNGS WYYLNAN 
GAMATGWAKVNGSWYYLNANGAMATGV^ 

DTWYYLEASGAMKAS QWFKVSDKWYYVNGLGALAWTTVDGYKVNANGEWV 

SEQ ID NO: 18 amino acid sequence comprising a portion of Se89.9 of Streptococcus equi 
Genbank reference GI 2330384 

ESDIVDATRFSTTEIPKSGQVIDRSASIQALTiroi^ 

QLEAAQAKSHKLDQQVEQLANTKD SLRTQLLAAKEEQAQLKANLDKAL ALLAS SKATIiHKLEAAMEEAKA 
RVAGLAS QKAQLEDLLAFEKNPNRI ELAQEKVAAAKKALADTEDKLLAAQASIjSDLQAQRARL 

SEQ ID NO: 19 polynucleotide sequence comprising GST-40-HIS 

CTGGTTCCGCGTGGATCCC^TATGAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGG 
AGAAACGAAGGCGAGTAATACTCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCAAAGGCAA 
CTATTGATGCAGTTGAAAAAACTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGACA 
AAAACTACTGCTGAAATCAACCACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAACCTCTGCACA 
AGAAATTTACACTAATACTCTTGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCA7UVGAG 
AGTTAACAGCTACTGAAACAGAGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACT'GCATTGTCA 
G7UVCAAAAAGCTAGCATTTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAAAACGTCTGA 
ACAAAATATTGCTAAGCTCAATGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTA 
ATGATAATAC AAAAGC ATTAAGC TCAGAATTGGAGAAGGC TAAAGC TGACTT AGAAAATC AAAAAGCTAAA 

GTTAAAAAGCAATTGACTGAAGAGTTGGCAGCTCAGAAAGGTGCTCTAGCAGAAAAAGAGGCAGAACTTAG 
TCGTCTTAAATCCTCAGCTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAG 
GCTATCCTCTTGAAGAACTTAAAAAATTAGAAGCTAGTGGTTATATTG 

TACAAAGAGGATGCAGATCAAATTATTGCGAAAGCTAGTCCAGGTAATCAATTAAATCAATACCAAGATAT 
TCCAGCAGATCGTAATCGCTTTGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGCTAGCGCAGT 
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TTGCAGCTGAGATGATTAATAGTGTAAGAAGACA 
CAAGAATTTGCAAGATTACTTAGTACCAGCTATAAGAAA^ 

CGGAC AGC CAGGGGTATC^GGGCATTATGGTGTOGGGC CTCATGATAAAACTATTAOT 

GAGCGTCAGGGCTCATTCGAAATGATGATAACATGTACGAGAATATCGGTGCTTTO 

GTGAATGGTATTAAACGTGGTATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTO 

TACATACGGCCATGCTATTAACTTTTTACGTGTAGATAAACATAACCCTAATGCGCCTGTTTACC 

TTTCAACCAGCAATGTAGGATCTTTGAATGAACACTO 

C^CGCTTTAATAAGACCCCTATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAG 
ATCTGATACTATTGCAGCGATCAAAGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCT^ 
AAGCTGATATTATGGCAGCCCAAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACACTTAAGCAG 
TC AGAC AGC TTAAATC TC C AAGTGAG ACAATTAAATG ATAC TAAAGGTTC TTTGAGAAC AGAATTAC T AGC 

AGCTAAAGCAAAACAAGCAC^^CTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAG 
CCGCACTGCACCAGACAGAAGCCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAA 
GCTCATTTGCAATATCTAAGGGACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGAGCGCATTGA 
TAATACTAAGCAAGATTTGGCTAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCC.TTAC 
AAGCTAAACAAAGCAGTCTAGAAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTA 
GCTAACGAAAAGGAATATCGCCACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACC 
TCTTACGGGCGTAAAACCGCTATCATATAGTAAGATAGATACTACT^ 

AAACGAAACAACTATTAGAAGCTTP^GCAAGATTAGCTGCTGAAAATACAAGTCTOGTAGCAGAAGC 

GTTGGCCAAACCTCTGAAATGGTAGCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCC 

CTCATCTAAGACATCTTATGGCTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTGATGAAAGTA 

CTCAAAGAGCTCTTAAAGCAGGAGTCGTCATGTTGGCAGCTGTCGGCCTCACAGGATTTAGGTTCCGTAAG 
GAATCTAAGGCGGCCGCACTCGAGCACCACCA.CCACCACCACCAC 



SEQ ID NO: 20 amino acid sequence comprising GST-40-HIS 
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SEQ ID NO: 21 polynucleotide sequence comprising 40a-HIS 

ATGAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAGTAATAC 
TCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCAAAGGCAACTATTGATGCAGTTGAAAAAA 
CTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAACTACTGCTGAAATCAAC 
CACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAACCTCTGCACAAGAAATTTACACTAATACTCT 
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TGCAAGTAGTGAGGAGACGCTATTAGCCC AA.GGAGCCGAAC ATCAAAGAGAGTTAAC AGC TACTGAAAC AG 
AGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGC^ 

GC AGAAAC TACTCG AGCTCAAGATTTAGTGGAAC AAGTC AAAACGTC TGAAC AAAATATTGCTAAGCTCAA 
TGCTATGATTAGCAATCCTGATGCTATCACTAAAGGAGCTCAAACGGCTAATC 

G C TC AGAATTGGAGAAGGC T AAAGCTGAC TTAGAAAATCAAAAAGCTAAAGTTAAAAAGCAATTGACTGAA 
GAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTCTTAAATCCTCAGCTCC 
GTCTACTCAAGATAG<^TTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTATCCTCTTGAAGAACTTA 
AAAAATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTACAATAATTATTAC71AAGAGCATGCAGATCAA 
ATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATACCAAGATATTCCAGCAGATCGTAATCGCTT 
TGTTGATC C C GATAATTTGAC ACCAGAAGTGC AAAATGAGCTAGC GC AGTTTGC AGCTCAC ATGATTAAT A 
GTGTAAGAAGACAATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGAATTTGCAAGATTACTT 
AGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGACAGCCAGGGGTATCAGG 
GCATTATGGTGTTGGGCCTCATGATAAAACTATTAOT 

ATGATGATAACATGTACGAGAATATCGGTGCTTTTAACGATGTGCATACTGTGAATGGTATTAAACGTGGT 
ATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTTAGACGGAAATACATACGGCCATGCTATTAA 
CTTTTTACGTGTAGATAAACATAACCCTAATGCGCCTGTTTACCTTGGATTTTCAACCAGCAATGTAGGAT 
CTTTGAATGAACACTTTGTAATGTTTCCAGAGTCTAACATTGCTA 

ATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGCACTGTATCTGATACTATTGCAGCGAT 

CAAAGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCTGATATTATGGCAGCCC 

AAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCAC^CTTAAGCAGTCAGACAGCTTAAATCTCCAA 

GTGAGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTAAAGCAAAACAAGCACA 

ACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCACTGCACCAGACAGAAG 

CCTTAGCAGAGC^GCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCATTTGCAATATCTAAGG 

GACTTTAAATTGAATCCTAACCGCCTTC^GTGATACGTGAGCGCATTGATAATACTAAGCAAGATOT 

TAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCCTTACAAGCTAAACAAAGCAGTCTAG ■ 

AAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAACGAAAAGGAATATCGC 

CACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTACGGGCGTAAAACCGCT 

ATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAATGGTTAAAGAAACGAAACAACTATTAGAAG 

CTTCAGCAAGATTAGCTGCTGAAAATACAAGTCTTGTAGCAGAAGCGCTTGTTGGCCAAACCTCTGAAATG 

GTAGGAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCCCTCATCTAAGACATCTTATGG 

CTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTGATGAAAGTACTCA AcQt:< ^GGCCGCACT?CG 

AGCACCACCACCACCACCACCAC 



SEQ ID NO: 22 amino acid sequence comprising 40a-HIS 
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TSSITQPSSKTSYGSGSSTTSNLISDVDESTQRAAA 
XiEHHBHHHH 

SEQ ID NO: 23 polynucleotide sequence comprising 40a-RR-HIS 

ATGAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAGTAATA^ 
• TCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCA^ 

CTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAACTACTGCTGAAATCAAC 

CACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAACCTCTGCACAAGAAATTTACACTAATACTC 

TGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTAACAGCTACTGAAACAG 

AGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCAT 

GCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTC^ 

TGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGATAATACAAAAGCATTAA 
GCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAAAAAGCTAAAGTTAAAAAGCAATTGACTGAA 
GAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTCTTAAATCCTCAGCTCC 
GTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTATCCTCTTGAAGAACTTA 
AAAAATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTACAATAATTATTACAAAGAGCATGCAGATCAA 
ATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATACCAAGATATTCCAGCAGATCGTAATC 

TGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGCTAGCGCAGTTTGCAGCTCACATGATTAATA 
GTGT AcGtcGt CAATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGAATTTG 

AGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGACAGCCAGK3GGTATCAGG 
GCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAGACTCTGCCGGAGCGTCAGGGCTCATTCGAA 
ATGATGATAACATGTACGAGAATATCGGTGCTTTTAACGATGTGCATACTGTGAATGGTATTAAACGTGGT 

ATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTTACACGGAAATACATACGGCCATGCTATTAA 
CTTTTTACGTGTAGATAAACATAACCCT^ 

CTTTGAATGAACACTTTGT AATGTTTCC AGAGTCTAAC ATTGC TAAC C ATC AAC GCTTTAATAAGAC C CC T 

ATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGCACTGTATCTGATACTATTGCAGCGAT 

CAAAGGAA2\AGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCTGATATTATGGCAGCCC 

AAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACACTTAAGCAGTCAGACAGCTTAAATCTCCAA 

GTGAGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTAAAGCAAAACAAGCACA 

ACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCACTGCACCAGACAGAAG 

CCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCATTTGCAATATCTAAGG 

GACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGAGCGCATTGATAATACTAAGCAAG 

TAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTVrAGCAGC^ 

AAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAACGAAAAGGAATATCGC 
CACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTACGGGCGTAAAACCGCT 
ATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAATGGTTAAAGAAACGAAACAACTATTAGAAG 
CTTCAGCAAGATTAGCTGCTGAAAATACAAGTCTTGTAGCAGAAGCGCTTGTTGGCCAAACCTCTGAAATG 
GTAGCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCCCTCATCTAAGACATCTTATGG 
CTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTGATGAAAGTACTCAAcGtGCGGCCGCACTCG 
AGCACCACCACCACCACCACCAC 



SEQ ED NO: 24 amino acid sequence comprising 40a-RR-HTS 
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SEQUENCE LISTING 

TKDYAQRVGTVSDTIAAIKGKVSSIiENRLSAIHQEA 
DIMAAQAKVSQIiQGKLASTLKQSDSIiNIiQVRQLNDT 
KGSLRTELLAAKAKQAQLEATRDQSIiAKLASLKAAIj 
HQTEAIiAEQAAARVTALVAKKAHLQYIiRDFKLNPNR 
LQVIRERIDNTKQDLAKTTSSLLNAQEALAAIiQAKQ 
SSLEATIATTEHQLTLIiKTLANEKEYRHLDEDIATV 
PDLQVAPPIiTGVKPIiSYSKIDTTPLVQEMVKETKQL 
Jj EASARLAAENT S LVAEAIiVGQTSEMVASNAI vs K I 
TSSITQP SSKTSYGSGSSTTSNLI SDVDESTQRAAA 
LERBHRHHH 

SEQ ED NO: 25 polynucleotide sequence comprising 40a-RR (nat) 

ATGAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAGTAATAC 

TC^CGACGATAGTTOACCAAAACCAGAAACAATTCAAGAGGCAAAGGCAACTATTGATGCAGTTGA 

CTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGAC^AAAACTACTGCTGAAATC 

CACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAACCTCTGCACAAGAAATTTACACTAATACTCT 

TGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTAACAGCTACTGAAACAG 

AGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTGAGAACAAAAAGCTAGCATTTC^ 

GCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAAAACGTCTGAACAAAATATTGCTAAGCTCAA • 

TGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGATAATACAAAAGCATTAA 

GCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAAAAAGCTAAAGTTAAAAAGCAATTGACTGAA 

GAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTCTTAAATCCTCAGCTCC 

GTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTATCCTCTTGAAGAAGTTA 

AAAAATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTACAATAATTATTAC7^AAGAGCATGCAGATCAA 

ATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCA 

TGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGCTAGCGCAGTTTGCAGCTCACATGATTAATA 
GTGT AcGtcGtC AATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGAATTTGCAAGATTACTT 
AGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGACAGCCAGGGGTATCAGG 
GCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAGACTCTGCCGGAGCGTCAGGGCTCATTCGAA 
ATGATGATAACATGTACGAGAATATCGGTGCTTTTAACGATGTGCATACTGTGAATGGTATTAAACGTGGT 
ATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTTACACGGAAATACATACGGCCATGCTATTAA 
CTTTTTACGTGTAGATAAACATAACCCTAATGCGCCTGTTC 

CTTTGAATGAACACTTTGTAATGTTTCCAGAGTCTAACATTGCTAACCATCAACGCTTTAATAAGACCCCT 
ATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGCACTGTATCTGATACTATTGCAGCGAT 
CA^GGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCTGATATTATGGCAGCCC 
AAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACACTTAAGCAGTCAGACAGCTTAAATCTCCAA 
GTGAGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTAAAGCAAAACAAGCACA 
ACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCACTGCACCAGACAGAAG 
CCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCATTTGCAATATCTAAGG 
GACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGAGCGCATTGATAATACTAAGCAAGATTTGGC 
TAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCCTTACAAGCTAAACAAAGCAGTCTAG 
AAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAACGAAAAGGAATATC^ 
CACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTACGGGCGTAAAACCGCT 
ATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAATGGTTAAAGAAACGAAACAACTATTAGAAG 
CTTCAGCAAGATTAGCTGCTGAAAATACAAGTCTTGTAGCAGAAGCGCTTGTTGGCCAAACCTCTGAAATG 
GTAGCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCCCTCAT H CTAAGACATCTTATGG 
CTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTGATGAAAGTACTCAAcGt 



SEQ ID NO: 26 amino acid sequence comprising 40a-RR (nat) 
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SEQ ID NO: 27 polynucleotide sequence comprising HIS-40a NH 

ATGG6ATCGCATCACCATCACCATCACGCTAGTAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAG 
AGCCTCAGGAGAAACGAAGGCGAGTAATACTCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGG 
CAAAGGCAACTATTGATGCAGTTGAAAAAACTCTCAGTCAACAT^AAGCAGAACTGACAGAGCTTGCTACC 
GCTCTGACAAAAACTACTGCTGAAATCAACCACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAAC 
CTCTGCACAAGAAATTTACACTAATACTCTTGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAAC 
A^CAAAGAGAGTTAACAGCTACTGAAACAGAGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACT 
GCATTGTCAGAACAAAAAGCTAGCATTTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAA 
AACGTCTGAACAAAATATTGCTAAGCTCAATGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTC 
AAACGGCTAATGATAATACAAAAGCATTAAGCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAA 
AAAGCTAAAGTTAAAAAGCAATTGACTGAAGAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGC 
AGAACTTAGTCGTCTTAAATCCTCAGCTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAG 
CACCGCAAGGCTATCCTCTTGAAGAACTTAAAAAATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTAC 
AATAATTATTACAAAGAGCATGCAGATCAAATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATA 
CCAAGATATTCCAGCAGATCGTAATCGCTTTGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGC 
TAGCGCAGTTTGCAGCTCACATGATTAATAGTGTAAGAAGACAATTAGGTCTACCACCAGTTACTGTTACA 
GCAGGATCACAAGAATTTGCAAGATTACTTAGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATC 
ATTTGTCTACGGACAGCCAGGGGTATCAGGGCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAG 
ACTCTGCCGGAGCGTCAGGGCTCATTCGAAATGATGATAACATGTACGAGAATATCGGTGCTTTTAACGAT 
GTGCATACTGTGAATGGTATTAAACGTGGTATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTT 
ACACGGAAATACATACGGCCATGCTATTAACTTTTTACGTGTAGATAAACATAACCCTAATGCGCCTGTT^ 
ACCTTGGATTTTCAACCAGCAATGTAGGATCTTTGAATGAACACTTTGTAATGTOT 

GCTAACCATCAACGCTTTAATAAGACCCCTATAAAAGCCGTTGGAAGTACAAAAGA^TATGCCCAAAGAGT 

AGGCACTGTATCTGATACTATTGCAGCGATCAAAGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCTA 

TTCATCAAGAAGCTGATATTATGGCAGCCCAAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACA 

CTTAAGCAGTCAGACAGCTTAAATCTCCAAGTGAGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGA 

ATTACTAGCAGCTAAAGCAAAACAAGCACAACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCAT 

CGTTGAAAGCCGCACTGCACCAGACAGAAGCCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTG 

GCTAAAAAAGCTCATTTGCAATATCTAAGGGACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGA 

GCGCATTGATAATACTAAGCAAGATTTGGCTAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTO 

CAGCCTTACAAGCTAAACAAAGCAGTCTAGAAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTT 

AAAACCTTAGCTAACGAAAAGGAATATCGCCACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGT 

AGCTCCACCTCTTACGGGCGTAAAACCGCTATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAA 

TGGTTAAAGAAACGAAACAACTATTAGAAGCTTCAGCAAGATTAGCTGCTGAAAATACAAGTCTTGTAGCA 

GAAGCGCTTGTTGGCCAAACCTCTGAAATGGTAGCAAGTAATGCCATTGTGTCTAAAATCAC^ 

TACTCAGCCCTCATCTAAGACATCTTATGGCTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTG 
ATGAAAGTACTCA AcGt 

SEQ ID NO: 28 amino acid sequence comprising HIS-40a NH 
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SEQ ED NO: 29 polynucleotide sequence comprising HIS-40a CH 

ATGGCTAGTAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAG 
TAATACTCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCAAAGGCAACTATTGATGCAGTTG 

AAAAAACTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGAC^AAAACTACTGCTGAA 
ATCAACCJ^TTAAAAGAGCAGCAAGATAATGAAC^ 

TACTCTTGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTAACAGCTACTG 

AAACAGAGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTCAGAACAAAAAGCTAGC 

ATTTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAAAACGTCTGAACAAAATATTGCTAA 

GCTCAATGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGATAATACAAAAG 

CATTAAGCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAAAAAGCTAAAGTTAAAAAGCAATTG 

ACTGAAGAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTCTTAAATCCTC 

AGCTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTATCCTCTTGAAG 

AACTTAAAAAATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTACAATAATTATTACAAAGAGCATGCA 

GATCAAATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATACCAAGATATTCCAGCAGATCGTAA 

TCGCTTTGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGCTAGCGCAGTTTGCAGCTCACATGA 

TTAATAGTGTAAGAAGACAATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGAATTTGCAAGA 
TTACTTAGTACCAGCTATAAGAAAACTCAT^ 

ATCAGGGCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAGACTCTGCCGGAGCGTCAGGGCTCA 

TTCGAAATGATGATAACATGTACGAGAATATCGGTGCTTTTAACGATGTGCATACTGTGAATGGTATTAAA 

CGTGGTATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTTACACGGAAATACATACGGCCATGC 
TATTAACTTTTTACGTGTAGATAAACAT^ 

TAGGATC TTTGAATGAAC ACTTTGTAATGTTTC C AGAGTCTAAC ATTGC TAACC ATCAAC GC TTTAATAAG 
ACCCCTATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGCACTGTATCTGATACTATTGC 
AGCGATCAAAGGTU^AAGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCTGATATTATGG 
CAGCCCAAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACACTTAAGCAGTCAGACAGCTTAAAT 
CTCCAAGTGAGACAATCAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTAAAGCAAAACA 
AGCACAACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCACTGCACCAGA 
CAGAAGCCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCATTTGCAATAT 
CTAAGGGACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGAGCGCATTGATAATACTAAGCAAGA 
TTTGGCTAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCCTTACAAGCTAAACAAAGCA 
GTCTAGAAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAACGAAAAGGAA 
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SEQUENCE LISTING 

TATCGCGACTTAGACGAAGATATAGCT 
ACCGCTATCATATAGTAAGATAGATACTACTCC^ 

TAGAAGCTTCAGCAAGATTAGCTGCTGAAAATACAAGTCTTGTAGCAGAAGCGCTTGTTGGC 
GAAATGGTAGC AAGTAATGC C ATTGTGTC TAAAATCAC ATCTTC GATTACTCAGC CCTCATCTAAGACATC 
TTATGGCTCAGGATCTTCTACAACGAGGAATC 
CACTCGAGCACCACCACCACCACCAC 



SEQ ID NO: 30 amino acid sequence comprising HIS-40a CH 
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SEQ ID NO: 31 polynucleotide sequence comprising HIS-40a-RR NH 

ATGGGATCGCATCACCATCACCATCACGCTAGTAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAG 
AGCCTCAGGAGAAACGAAGGCGAGTAATACTCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGG 
CAAAGGCAACTATTGATGCAGTTGAAAAAACTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACC 
GCTCTGACAAAAACTACTGCTGAAATCAACCACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAAC 
' CTCTGCACAAGAAATTTACACTAATACTCTTGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAAC 
ATCAAAGAGAGTTAACAGCTACTGAAACAGAGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACT 
GCATTGTCAGAACAAAAAGCTAGCATTTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAA 
AACGTCTGAACAAAATATTGCTAAGCTCAATGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTC 
AAACGGCTAATGATAATACAAAAGGATTAAGCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAA 
AAAGCTAAAGTTAAAAAGCAATTGACTGAAGAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGC 
AGAACTTAGTCGTCTTAAATCCTCAGCTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAG 
CACCGCAAGGCTATCCTCTTGAAGAACTTAAAAAATTAGAAGCTAGTGGTTATATTGGATCAGCTAGTTAC 
AATAATTATTACAAAGAGCATGCAGATCAAATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATA 
CCAAGATATTCCAGCAGATCGTAATCGCTTTGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGC 
TAGCGCAGTTTGCAGCTCACATGATTAATAGTGTAcGtcGtCAATTAGGTCTACCACCAGTTACTGTTACA 
GCAGGATCACAAGAATTTGCAAGATTACTTAGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATC 
ATTTGTCTACGGACAGCCAGGGGTATCAGGGCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAG 
ACTCTGCCGGAGCGTCAGGGCTCATTCGAAATGATGATAACATGTACGAGAATATCGGTGCTTTTAACGAT 
GTGCATACTGTGAATGGTATTAAACGTGGTATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTT 
ACACGGAAATACATACGGCCATGCTATTAACTTTTTACGTGTAGATAAACATAACCCTAATGCGCCTGTTT 
ACCTTGGATTTTCAACCAGCAATGTAGGATCTTTGAATGAACACTTTGTAATGTTTCCAGAGTCTAACATT 
GCTAACCATCAACGCTTTAATAAGACCCCTATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGT 
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AGGCACTGTATCTGATACTATTGCAGCGA^ 

TTCATCAAGAAGCTGATATTATGGCAGCCGAAGC^ 

CTTAAGCAGTCAGACAGCTTAAATCTCCAAGTG^^ 

ATTACTAGCAGCTAAAGCAAAACAAGCACAACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCAT 

CGTTGAAAGCCGCACTGCACCAGACAGAAGCCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTC 
GCTAAAAAAGCTCATTTGC^TATCTAAGG^ 

GCGCATTGATAATACTAAGCAAGATTTGGCTAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAG 

CAGCCTTACAAGCTAAACAAAGCAGTCTAGAAGCTACTATTGCTACCACAGAACACCAGTTCACT^ 

AAAACCTTAGCTAACGAAAAGGAATATCGCCACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGT 

AGCTCCACCTCTTACGGGCGTAAAACCGCTATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAA 

TGGTTAAAGAAACGAAACAACTATTAGAAGCTTCAGCAAGATTAGCTGCTGAAAATACAAGTCTTGTAGCA 

GAAGCGCTTGTTGGCCAAACCTCTGAAATGGTAGCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGAT 

TACTCAGCCCTCATCTAAGACATCTTATGGCTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTG 
ATGAAAGTACTCAAcGt 



SEQ ID NO: 32 amino acid sequence comprising HIS-40a-RR NH 
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D E S T Q R 

SEQ ID NO: 33 polynucleotide sequence comprising 40N-HIS 

ATGCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAGTAATACTCACGACGATAGTTTACC 

AAAACCAGAAACAATTCAAGAGGCAAAGGCAACTATTGATGCAGTTGAAAAAACTCTCAGTCAACAAAAAG 

CAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAACTACTGCTGAAATCAACCACTTAAAAGAGCAGCAA 

GATAATGAACAAAAAGCTTTAACCTCTGCACAAGAAATTTACACTAATACTCTTGCAAGTAGTGAGGAGAC 

GCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTAACAGCTACTGAAACAGAGCTTCATAATGCTCAAG 

CAGATCAACATTC7^AAAGAGACTGCATTGTCAGAACAAAAAGCTAGCATTTCAGCAGAAACTACTCGAGCT 

CAAGATTTAGTGGAACAAGTCAAAACGTCTGAACAAAATATTGCTAAGCTCAATGCTATGATT 

TGATGCTATCACTAAAGCAGCTCAAACGGCTAATGATAATACAAAAGCATTAAGCTCAGAATTGGAGAAGG 

CTAAAGCTGACTTAGAAAATCAAAAAGCTAAAGTTAAAAAGCAATTGACTGAAGAGTTGGCAGCTCAGAAA 

GCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTCTTAAATCCTCAGCTCCGTCTACTCAAGATAGCAT 

TGTGGGTAATAATACCATGAAAGCACCGCAAGGCTATCCTCTTGAAGAACTTAAAAAATTAGAAGCTAGTG 

GTTATATTGGATCAGCTAGTTACAATAATTATTACAAAGAGCATGCAGATCAAATTATTGCCAAAGCTAGT 

CCAGGTAATCAATTAAATCAATACCAAGCGGCCGCACTCGAGCACCACCACCACCACCACCAC 
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SEQ ID NO: 34 amino acid sequence comprising 40N-HES 

MQVKADDRASGETKASNTHDDSLPKPETIQEAKATI 
DAVEKTLSQQKAELTELATALTKTTAEINHLKEQQD 
NEQKALTSAQEIYTNTLASSEETLLAQGAEHQRELT 
ATETELHNAQADQHSKETALSEQKASISAETTRAQD 
LVEQVKTSEQNIAKLNAMI SNPDAITKAAQTANDNT 

KALSSELEKAKADLENQKAKVKKQLTEELAAQKAAIi 
AEKEAELSRLKSSAPSTQDSIVGNNTMKAPQGYPLE 
ELKKIiEASGYI GSASYNNYYKEHADQI IAKASPGNQ 
LNQYQAAALEHHHHHHH 

SEQ ID NO: 35 amino acid sequence comprising GAS 117 

MTLKKHYYLLSLLALVTVGAAFNTS Q S VS AQVYSTTOnvwnwT .TTypir ottt .nv qtt rvivj^ qt jpyyf T ,p^| yo^ynXj 
GRHYS S YYYYNLRTVMGLS SEQDI EKHYEELKNKLHDMYNHY 

SEQ ID NO: 36 polynucleotide sequence encoding GAS 117 

ATGACACTAAAAAAACACTATTATCTTCTCAGCCTGCTAGCTCTTGTAACGGTTGGTGCTGCCTTTAACAC 
AAGCCAGAGTGTCAGTGCACW^GTTTATAGCAATGAAGGGTATCACCAGCATTTGACTGATGAAAAATCAC 
ACCTGCAATATAGTAAAGACAACGCACAACTTCAATTGAGAAATATCCTTGACGGCTACCAAAATGACCTA 
GGGAGACACTACTCTAGCTATTATTACTACAACCTAAGAACCGTTATGGGACTATCAAGTGAGCAAGACAT 
TGAAAAACACTATGAAGAGCTTAAGAACAAGTTACATGATATGTACAATCATTATTAA 

SEQ ID NO: 37 amino acid sequence comprising GAS 117 leader sequence 

TLKKHYYLL SLLALVTVGA 

SEQ ID NO: 38 amino acid sequence comprising fragment of GAS 1 17 where leader sequence 
is removed 

AFNTSQSVSAQVYSNEGYHQHLTDEKSHLQYSKDNAQLQ 
EQD I EKHYEELKNKLHDMYNHY 

SEQ ID NO: 39 amino acid sequence comprising GAS 130 

MS HMKKRPEVLS PAGTLEKLKVAI DYGADAVFVGGQAYGLRSRAGNF SMEELQEGIDYAHARGAKVYVAAN 
MVTHEGNE IGAGEWFRQLRDMGLDAVIVSD PALI VIC STEAPGLE IHLSTQASSTIJYETF EFWKAMGLTRV 
VLAREVNMAELAEI RKRTDVEI EAFVHGAMC I S YSGRCVLSNHMSHRDANRGGCSQSCRWKYDLYDMPFGG 

ERRSIiKGEIPEDYSMSSVDMCMIDHIPDLIENGVDSLKIEGRMKSIHYVSTVTNCYKAAVGAYMESPEAFY 
AI KEELIDELWKVAQRELATGF YYGI PTENEQLFGARRKI PQYKFVGEWAFDS ASMTAT I RQRNVTMEGD 

RIECYGPGFRHFETVVKDLHDADGQKIDRAPNPMELLTISLPREVKPGDMIRACKEGLV^ 

SEQ ID NO: 40 polynucleotide sequence encoding GAS 130 

ATGTCACATATGAAAAAACGTCCCGAGGTCTTATCACCTGCTGGAACACTTGAAAAATTAAAAGTTGCGAT 

TGACTATGGCGCAGATGCTGTTTTTGTTGGAGGGCAGGCCTATGGCCTAAGAAGCCGCGCTGGTAACTTCT 

CTATGGAAGAATTGCAAGAAGGCATTGATTATGCACATGCGCGTGGAGCTAAGGTCTATGTTGCTGCTAAC 

ATGGTTACCCACGAAGGGAACGAAATTGGTGCGGGCGAGTGGTTTCGTCAACTGCGTGATATGGGGCTTGA 

TGCGGTCATTGTTTCAGATCCAGCCTTGATTGTTATTTGTTCAACAGAAGCCCCAGGTTTGGA 
TGTCAACGCAAGCTTCATCTACCAATTACGAGACCTTTC 

GTTTTAGCTCGCGAGGTTAATATGGCCGAGTTAGCAGAAATCCGCAAGCGGACAGATGTGGAAATTGAAGC 

CTTTGTCCATGGAGCCATGTGTATCTCTTATTCAGGCCGCTGTGTTTTGTCAAACCACATGAGTCACCGTG 

ATGCCAACAGGGGCGGCTGCTCACAGTCTTGCCGCTGGAAGTATGATTTGTATGACATGCCATTTGGAGGA 

GAGCGCCGCTCCTTAAAAGGGGAAATTCCAGAAGACTATTCTATGTCCTCTGTTGACATGTGTATGATTGA 

CCATATTCCTGACCTGATTGAAAATGGGGTTGATAGCTTAAAAATTGAAGGCCGAATGAAATCTATCCACT 
ACGTCTCAACCGTAACCAACTGTTACAAGGCGGCTG^ 

GCTATCAAAGAGGAATTGATTGACGAGTTGTGGAAGGTTGCCCAGCGCGAGTTGGCTACAGGTTTTT 

TGGTATCCGAACTGAAAATGAACAATTATTTGGTGCTCGCCGCAAAATTCCACAATATAAATTT 

AAGTAGTTGCCTTTGACTCAGCTAGCATGACAGCGACCATTCGTCAGCGTAATGTCATCATGGAAGGCGAT 
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CGGATTGAATGTTATGGACCAGGTTTCCGTC^ 
CCAAAAGATTGACCGTGCCCGAAATCCAATGGAACTCTO 
GGGATATGATTAGGGCTTGCAAGGAAGGTCTGGTTA^^ 
AGAACATAG 

SEQ ID NO: 41 amino acid sequence comprising GAS 277 

MTTMQKT I SLLSLALL I GLLGTSGKAI SVYA QDQHTDNVIAESTI SQVSVEASMRGTEPYIDATVTTDQPV 

RQPTQATITLKDASDOTINSWTTMA^^ 

KARKTPTl^QQKDTSKAMTNSVDVDTKAQTNQSANQ^ 

SQKNGSNKTKMLVDKEEVKPTSKRGFPWVLLGLVVSliAAGLFIAIQKVSRRK 

SEQ ID NO: 42 polynucleotide sequence encoding GAS 277 
atgacaactatgcaaaaaacaattagcttattatc^ 

caaagccatatctgtgtatgcacaagatcagcacactgataatgttatagctgaatcaactattagtcagg 

tcagtgttgaagccagtatgcgtggaacagaaccttatattgatgctacagtcaccacagatcaacctgtc 

agacaaccaactcaggcaacgataacacttaaagacgctagtgataatactattaatagttgggtatatac 

tatggcagcgcaacagcgtcgttttacagcttggtttgatttaactggacaaaagagtggtgact 

taactgtcaccgttcatactcaagaaaaggcagtaactggtcaatcaggaactgttcattttgatcaaaac 

aaagctagaaaaacaccaactaatatgcaacaaaaggatacttctaaagcaatgacgaattcagtcgatgt 

agacacaaaagctcaaacaaatcaatcagctaaccaagaaatagattctacttcaaatcctttcagatcag 

ctactaatcatcgatcaacttccttaaagcgatctactaaaaatgagaaacttacaccaactgctagtaat 

agccaaaaaaacggtagcaacaagacaaaaatgctagtggacaaagaggaagtaaaacctacttcaaaaag 

aggattcccttgggtcttattaggtctagtagt^ 

tatctagacgaaaataa 

. SEQ ID NO: 43 amino acid sequence comprising N-terminal leader sequence of GAS 277 

TTMQKTI SLL SIjALIiIGLIiGTSGKAI S VYA 

SEQ ID NO: 44 amino acid sequence comprising fragment of GAS 277 where N-terminal leader 
sequence is removed 

QDQHTDNVI AE STI S QVSVEASMRGTEP YI DATVTTDQPVRQ PTQATITLKDASDNTINSWVYTMAAQQRR 
FTAWFDLTGQKSGDYHVTVTVHTQEKAVTGQSGTVHFDQNKARKTPTNM 
QSANQEIDSTSNPFRSATNHRSTSLKRSTKNEKLTPTASNSQKNGSl^T 
GLWSLAAGLFIAIQKVSRRK 

SEQ ID NO: 45 amino acid sequence comprising GAS 236 

MTQMNYTGKVKRVAI I ANGKYQSKRVASKLFSVFKDDPDFYLSKKNPDIVI S IGGDGMIiLSAFHMYEKELD 
KVRFVGIHTGHIiGFYTDYRDFEVDKLIDNIiRKDKGEQI SYPILKVAITLDDGRVVKARALNEATVKRI EKT 
MVADVIINHVKFESFRGDGISVSTPTGSTAYNKSLGGAVIiHP 

KDKIELVPKRLGIYTISIDNKTYQLKNVTKVEYFIDDEKIHFVSSPSHTSFWERVKDAFIGEIDS 

« 

SEQ ID NO: 46 polynucleotide sequence encoding GAS 236 

ATGACACAGATG AATTATACAGGTAAGGTAAAACGAGTTGCTATTATTflH A A ATCCT A anTarn a a acta a 
ACGCGTCGCCTCCAAACTTTTCTCCGTATTTAAAGATGATCCTGATTTCTATCTTTCAAAGA^ 
ATATTGTGATTTCTATTGGCGGAGATGGGATGCTCTTATCTGCCTTTCACATGTATGAAAAAGAATTAGAT 
AAGGTACGTTTTGTAGGAATCCACACCGGTCATCTTGGCTTTTATACCGATTATAGGGATTTTGAAGTTGA 
TAAATTAATTG ATAATTTAAGAAAAG AC AAGGGAGAAC AAATC TC TT ATC C G ATTTT AAAAGTTGC T ATTA 

CTTTAGATGATGGTCGTGTGGTTAAAGCGCGTGCTTTGAATGAAGCGACGGTTAAGCGTATTGAAAAAA 
ATGGTAGCAGATGTTATTATTAACCATGTCAAAT™ 

CCCGACAGGGAGCACAGCCTACAATAAATCTTTAGGTGGTGCTGTCTTGCATCCGACGATTGAAGCGCTGC 
AATTGACGGAAATTTCCAGTCTTAATAACCGTGTC 

AAAGATAAGATTGAGTTAGTGCCAAAACGATTAGGAATTTATACCATTTCCATTGATAATAAAACCTATCA 
GTTAAAAAATGTGACGAAGGTGGAGTATTTTATCGACGATGAGAAAATTCATTTTGTTTCCTCTCCGAGTC 
ATACGAGCTTTTGGGAAAGGGTCAAGGATGCCTTTATTGGAGAGATTGACTCATGA 
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SEQ ID NO: 47 amino acid sequence comprising N-terminus leader. sequence of GAS 236 

MTQM 

SEQ ED NO: 48 amino acid sequence comprising a fragment of GAS 236 where the N-terminal 
leader sequence is removed 

NYTGKVKRVAIIANGKYQSKRVASK^ 

VG IHTGHLGFYTDYRDFEVDKLIDNLRKDKGEQ I S YP ILKVAITIjDBGRVVKARAIiNEATVKRI EKTMVAD 

VIINHVKFESFRGDGISVSTPTGSTAYNKSL^^^ 

ELVPKRLGIYTISIDNKTYQLKNVTKVEYF^ 

■ 

SEQ ID NO: 49 amino acid sequence comprising GAS 389 

MRNEMAKIMNVTC 

AVTVACGF LHDWEDTD ITLDE I EADFGHDARD IVDGVTKLGEVEYKS HEEQLAENHRKMLMAMS KDX RVI 
LVKLADRLHNMRTLKHLRKDKQERI SRETMEI YAPLAHRJjGISRIKWEIjEDLAFRYIiNETEFYKI SHMMKE 
KRREREALVEAIVSKVKTYTTQQGLFGDVYGRPKHIYSIYRKMRDKKKRFDQIFDIiIAIR 
mvgyihelwrpmpgrfkdyiaapkangyqs IHTTVYGPKGPI EI QI RTKDMHQVAEYGVAAHWAYKKGVRG 

kwqaeqavgmnwikelvelqdas^^ 

Q IGEKATGAKVNGRMVPLiTAKLKTGDWEI itnansfgpsrdwvkdvktjtecarnki rqffknqdkelsvi^ 

GRDLLVS YFQEQGYVANKYLDKKRI EAILPKVS VKSEE SLYAAVGFGDI S P I SVFNKLTEKERREEERAKA 

KAEAEELVKGGEVKHENKDVLKVRSENGVIIQGASGLLMRIAKCCNPVPGDPIDGYITKGRG 
NIKSQDGYQERLI EVEWDLDNS SKDYQAEIDI YGLNRSGIiLNDVLQILSNSTKS I STVNAQPTKDMKFANI 
HVS FGI PNLTHLTTWEKI KAVPDVYS VKRTNG 

SEQ ID NO: 50 polynucleotide sequence encoding GAS 389 

ATGAGGAACGAAATGGCAAAAATAATGAACGTAACAGGAGAAGAAGTCATTGCCTTAGCGGCCACCTATAT 
GACCAAGGCTGATGTGGCTTTTGTGGCAAAGGCTTOAGCATATGCAACAGCGGCCCATTTCTACCAAGTGA 
GAAAGTC AGGC GAAC CCTATATCGTCCATCCGATTC AGGTGGCGGGGATTC TGGCTGATTTGCATCTGGAT 

GCTGTGACAGTTGCTTGTGGCTTTTTACATGATGTCGTAGAAGATACGGATATTACCTTAGATGAGATCGA 
AGCAGACTTTGGCCATGATGCTCGTGATATCGTTGATGGTGTCACCAAGTTAGGTGAAGTTGAGTACAAAT 
CTCATGAGGAGCAACTCGCCGAAAACCATCGCAAAATGCTGATGGCTATGTCCAAAGATATTCGCGTGATT 
TTGGTGAAATTGGCTGACCGCCTGCATAATATGCGCACCCTCAAACATTTGCGCAAGGACAAACAAGAGCG 

catttcgcgcgaaaccatggaaatctatccccccttggcgcatcgtttggggattagtcgcatcaaatc 

aactagaagatttggcttttcgttacctcaatgaaaccgaattttacaaaatttcccatatgatgaaagaa 

aaacgtcgcgagcgtgaagctttggtagaggctattgtcagtaaggtcaaaacctatacgacacaacaagg 
gttgtttggagatgtgtatggccgaccaaaacacatttat 

aacgattcgatcagatttttgatctgattgccattcgttgtgtcatggaaacgcaaagcgatgtctatgct 

atggttggctatattcatgagctttggcgtcccatgccaggccgcttcaaggattatattgcagctcctaa 

agctaatggctaccagtctattcataccaccgtgtatgggccaaaaggacctattgagattcaaatcagaa 

ctaaggacatgcatcaagtggctgagtacggggttgctgctcactgggcttataaaaaaggcgtgcgtggt 

aaggtcaatcaagctgagcaagccgttggcatgaactggatcaaagagctggtagaattgcaagatgcctc 

aaatggcgatgcagtggactttgtggattcggtcaaagaagacattttttctgaacggatttatgtcttta 

caccgacaggggccgttcaggagttacca7y\agaatcaggtcctattgattttgcttatgcgatccatacg 

caaatcggtgaaaaagcaacaggtgccaaagtcaatggacgtatggttcctctcactgccaagttaaaaa^ 

aggagatgtggttgaaatcatcaccaatgccaattcctttggccctagtcgagactgggtaaaactggtca 

■ aaaccaataaggctcgcaacaaaattcgtcagttctttaaaaatcaagacaaggaattgtcagtgaataaa 
ggccgtgatttgttggtgtcttattttc^ 

cattgaagccatccttccaaaagtcagtgtgaagagcgaagaatcactctatgcagccgttgggtttggtg 
acattagtcctatcagtgtctttaacaagttaaccgaaaaagagcgccgtgaagaagaaagggccaaggct 
aaagcagaagctgaagaattggttaagggcggtgaggtcaaacacgaaaacaaagatgtgctcaaggttcg 
cagtgaaaatggagtcattatccaaggagcatcaggcctcttgatgcggattgccaagtgttgtaatcctg 

TACCTGGTGATCCTATTGACGGCTACATTACCAAAGGGCGTGGCATTGCGATTCACAGATCGGACTGTCAT 
AAC ATTAAGAGTC AAGATGGCTAC C AAGAAC GCTTGATTGAGGTCG AGTGGGATTTGGAC AATTC GAGTAA 

AGATTATCAGGCTGAAATTGATATCTATGGGCTCAATCGTAGTGGTCTGCTTAATGATGTGCTCCAAATTT 

TATCAAACTCAACCAAGAGCATATCGACAGTCAATGCTCAGCCGACCAAGGACATGAAGTTTGCTAATATT 

CACGTGAGCTTTGGCATTCCAAATCTGACGCATCTGACCACTGTTGTCGAAAAAATCAAGGCAG 
TGTTTATAGCGTGAAGCGGACCAATGGCTAA 
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0 

SEQ ID NO: 51 amino acid sequence comprising GAS 504 

MKTRITELLNIDYPI FQGGMAWVADCffiLAGAVSNAGGLGI IGGGNAPKEVVKANIDRVKAITDRPFGVNIM 
LLSPFADD IVDIiVI EEGVKWTTGAGNPGKYMERIjHQAGI IWPWP SVALAKRMEKLGVDAVI AEGMEAG 
GHIGKLTTMSLVRQVVEAVSIPVIAAGGIADGHG 
KDIDT^SAQWGHPVRSIKI^TSAYAKAEKAFIjI^ 
AGLVRKEESCET IIjKDI YYGAARVIQNEAKRWQSVS I EK 

♦ 

SEQ ED NO: 52 polynucleotide sequence encoding GAS 504 

ATGAAAACACGTATTACAGAATTACTTAATATTGATTACCCCATTTTTCAAGGAGGAATGGCTTGGGTTGC 

TGATGGTGATTTAGCAGGTGCAGTTTCTAATGCTGGTGGTTTAGGCATTATAGGT 

AAGAAGTCGTTAAAGCTAATATTGATCGTGTCAAAGCTATTACTGAT 

CTTTTATCTCCTTTTGCTGATGATATCGTTGATCTGGTCATTGAAGAAGGTGTTAAAGTAGTAACAACAG^ 

CGCAGGAAATCC^GGAIUVGTATATGGAAAGACTGCACCAGGCGGGTATAATCGTTGTTCCTGTTGTCCC^ 

GCGTTGCGCTAGCCAAACGTATGGAAAAGCTTGGGGTAGATGCTGTTATTGCTGAGGGTATGGAAGCTGGA 

GGACATATTGGCAAGTTAACGACTATGTCTTTAGTAAGACAAGTTGTTGAAGCGGTTTCGATTCCTGTCAT 

TGCGGCAGGTGGTATAGCTGATGGTCATGGTGCAGCAGCAGCATTTATGTTAGGAGCAGAGGCTGTTCAAA 

TTGGAACTCGCTTTGTTGTTGCTAAAGAATCCAATGCTCACC^^ 

AAAGATATTGATACGGTGATTTCTGCGCAGGTTGTGGGCCACCCTGTCCGTTC 

CTCAGCTTACGCTAAAGCAGAAAAAGCATTTTTAATT 

GAGCAGGATCGCTTCGACACGCTGTTATTGAAGGCGATGTAGTCAATGGATCTGTTATGGCTGGCCAAATT 

GCAGGGCTTGTGAGAAAAGAAGAAAGCTGTGAAACGATTTTAAAAGATATTTATTATGGT 

TATTCAAAATGAAGCTAAGCGCTGGCAATCTGTTTCAATAGAAAAGTAG 

SEQ ID NO: 53 amino acid sequence comprising GAS 509 

MTKI YKTITELVGQTP I 1 KLNRL I PNEAADVYVKLEAFNPGS SVKDRI ALSMI EAAEAEGLI S PGDVI IE 
PTSGNTGIGLAWVGAAKGYRVI IVMPETMSLERRQ 1 1 QAYGAEIiVLTPGAEGMKGAI AKAETLAI ELGAW 
MPMQFNNPANPS I HEKTTAQE I LEAFKEI SLDAFVSGVGTGGTLSGVSHVLKKANP ETVI YAVEAEES AV 
LSGQEPGPHKI QGI SAGFI PNTLDTKAYDQI IRVKSKDALETARLTGAKE GFLVGI S SGAAIiYAAIEVAK 
QLGKGKHVLTILPDNGERYLSTELYDVFVTKTK 

SEQ ID NO: 54 polynucleotide sequence encoding GAS 509 

. ATGACTAAAATTTACAAAACTATAACAGAATTAGTAGGTCAAACACCTATTATCAAACTTAACCGTTTAA 
TTCCAAACGAAGCTGCTGACGTTTATGTAAAATTAGAAGCTTTTAACCCAGGATCTTCTGTTAAAGATCG 
TATTGCTTTATCGATGATTGAAGCTGCTGAAGCTGAAGGTCTGATAAGTCCTGGTGACGTTATTATCGAA 
CCAACAAGTGGTAATACAGGTATTGGTCTTGCATGGGTAGGTGCTGCTAAAGGGTATCGAGTCATTATTG 
TTATGCCCGAAACTATGAGCTTGGAAAGACGGCAAATCATTCAGGCTTATGGTGCAGAGCTTGTCTTAAC 
ACCTGGAGCAGAAGGTATGAAAGGGGCTATTGCAAAAGCTGAAACTTTAGCAATAGAACTAGGTGCTTGG 
ATGCCTATGCAATTTAATAACCCTGCCAATCCAAGCATCCATGAAAAAACAACAGCTCAAGAAATTTTGG 
AAGCTTTTAAGGAGATTTCTTTAGATGCATTCGTATCTGGTGTTGGTACTGGAGGAACACTTTC 
TTCACATGTCTTGAAAAAAGCTAACCCTGAAACTGTTATCTATGCTGTTGAAGCTGAAGAATCTGCTGTC 
TTATCTGGTCAAGAGCCTGGACCACATAAAATTCAAGGTATATCAGCTGGATTTATCCCAAACACGTTAG 
ATACCAAAGCCTATGACCAAATTATCCGTGTTAAATCGAAAGATGCTTTAGAAACTGCTCGACTAACAGG 
AGCTAAGGAAGG CTTCCTGGTTGGGATTTCTTCTGGAGCTGCTCTTTACGCCGCTATTGAAGTCGCTAAA 
CAGTTAGGAAAAGGCAAA-CATGTGTTAACTATTTTACCAGATAATGGCGAACGCTATTTATCGACTGAA^ 
TCTATGATGTACCAGTAATTAAGACGAAATAA 

SEQ ID NO: 55 amino acid sequence comprising C-terminus transmembrane region of GAS 
509 

FLVGI S SGA&LYAAI EVAKQLGKGKHVLT ILPDNGERYLSTELYDVPVT KTK 

SEQ ID NO: 56 amino acid sequencing comprising a fragment of GAS 509 where the C- 
terminal transmembrane region is removed 

MTKI YKTITELVGQTPI IKLNRLI PNEAADVYVKLEAFNPGSSVKDRI ALSMIEAAEAEGLISPGDVIIEP 
TSGNTGIGLAWVGAAKGYRVT IVMPETMSLERRQI I Q A YGAELVLTPGAEGMKGAI AKAETLAI ELGAWMP 
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SEQUENCE LISTING 

MQFNNPANPSIHEKTTAQEILEAFKEISLDAFVSGVGTGGTL 
QEPGPHKIQGI SAGFI PNTLDTKAYDQI IRVKSKDALETAKLTGAKEG 

■ 

SEQ ID NO: 57 amino acid sequence comprising GAS 366 

MKVISNFQNKKILILGLAKSGE&AAK LLTKL^ 

NFEYMVKNPGI PYDNPMVKRALAKEI P ILTEVELAYFVSEAP 1 1 GITGSNGKTTTTTMI ADVLNAGGQSAL 
LSGNIGYPASKWQKAXAGDTLVMELSSFQLVGVNAFRPHI^ 

MTESDYLILNANQEI SATLAKTTKATVIPFSTQKWDGAYLKDGILYFKEQAI I AATDLGVPGSHNIENAL 

ATIAVAKLSGIADDIIAQCIiSHFGGVKHRLQRVGQIKDITFYNDSKSTNiriATQK^ 

LDRGNEFDDLVPDLLGLKQMIILGESAERMKRAANKAE^ 

SWDMYPNFEVRGDEFLATFDCLRGDA 

SEQ ID NO: 58 polynucleotide sequence encoding GAS 366 

ATGAAAGTGATAAGTAATTTTCAAAACAAAAAAATATTAATATTGGGGTTAGCCAAATCGGGCGAAGCAGC 

AGCAA AATTATTGACGAAACTTGGTGCTTTAGTGACTGTTAATGATAGTAAACCATTTGACCAAAATC 

CGGCACAAGCCTTGTTGGAAGAGGGGATTAAGGTCATTTGTGGTAGCCACCCAGTAGAATTATTAGATGAG 

AACTTTGAGTACATGGTTAAAAACCCTGGGATTCCTTATGATAATCCTATGGTTAAACGCGCCCTTGCAAA 

GGAAATTCCCATCTTGACTGAAGTAGAATTGGCTTATTTC 

GATCAAACGGGAAGACAACCACAACGACAATGATTGCCGATGTTTTGAATGCTGGCGGGCAATCTGCACTC 
TTATCTGGAAACATTGGTTATCCTGCTTCAAAAGTT 

GGAATTGTCCTCTTTTCAATTAGTGGGAGTGAATGCTTTTCGCCCTCATATTGCTGTCATCACTAATTTAA 

TGCCGACTCACCTGGACTATCATGGCAGTTTTGAGGATTATGTTGCTGCTAAATGGATGATTCAAGCTCAG 

ATGACAGAATCAGACTACCTTATTTTAAATGCTAATCAAGAGATTTCAGCAACTCTAGCTAAGACCACCAA 

AGCAACAGTGATTCCTTTTTCAACTCAAAAAGTGGTTGATGGAGCTTATCTGAAGGATGGAATACTCTA 

TTAAAGAACAGGCGATTATAGCTGCAACTGACTTAGGTGTCCCAGGTAGCCACAACATTGAAAATGCCCTA 

GCAACTATTGCAGTTGCCAAGTTATCTGGTATTC5CTGATGATATTATTGCCCAGTGCCTTTCACATTTTGG 

AGGCGTTAAACATCGTTTGCAACGGGTTGGTCAAATCAAAGATATTACCTTCTACAATC 

CCAATATTTTAGCCACTCAAAAAGCTTTATCAGGTTTTGATA 

CTAGATCGTGGCAATGAATTTGACGATTTGGTGCCAGACCTTTTAGGACTTAAGCAGATGATTATTTTGGG 

AGAATCCGCAGAGCGTATGAAGCGAGCTGCTAACAAAGCAGAGGTCTCTTATCTTGAAGCTAGAAATGTGG 

CAGAAGCAACAGAGCTTGCTTTTAAGCTGGCCCAAACAGGCGATACTATCTTGCTTAGCCCAGCCAATGCT 

AGCTGGGATATGTATCCTAATTTTGAGGTTCGTGGGGATGAATTTTTGGCAACCTTTGATTGTTTAAGAGG 
AGATGCCTAA 

SEQ ID NO: 59 amino acid sequence comprising N-terminal leader sequence of GAS 366 

MKVI SNFQNKKILILGLAKSGEAAA 

SEQ ID NO: 60 amino acid sequence comprising a fragment of GAS 366 where the N-terminal 
leader sequence is removed 

KLLTKLGALVTVXTOSKPFDQNPAAQALLEE 

I P ILTEVELAYFVSEAP I IGITGSNGKTTTTTMI ADVLNAGGQ S ALL SGNIGYP ASKVVQKAI AGDTLVME 

LS S FQLVGVNAFRPH I AVI TNLMPTHLDYHGS FEDYVAAKWMI QAQMTESDYL ILNANQEI SATLAKTTKA 

TVIPFSTQKVVDGAYLKDGILYFKEQAIIAATDLGVPGSHNIENALATIAVAKLSGIADDIIAQCL 

VKHRLQRVGQIKDITFYNDSKSTOTLATQKALSGFDNSRLILIAGGLDRGNEFDDLVPDLLGLKQMIILGE 

SAERMKRAANKAWSYLEARWAEATEIAFKLAQ^^^ 

A 

SEQ ID NO: 61 amino acid sequence comprising GAS 159 

MRKLYSFLAGVLGVIVILTSLSFIL QKKSGSGSQSDKLVIYI^GDYIDPALLKKFTKETGIEVOYETFDSN 

EAMYTKIKQGGTTYDIAVPSDYTIDKMIKENLLNKLDKSKLVGMDNIGKEFLGKSFDPQNDYSLPYFWGTV 

GIVYNDQLVDKAPMHWEDLVmPEYKNSIMLI^ 

KAIVADEMKGYMIQGPAAIGITFSGEASEMLDSNEm^ 

INRPENAAQNAAYIGYATPNKKAKALLPDEIKftTOPAFYPTDDIIKK^ 
RK 
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SEQ ID NO: 62 polynucleotide sequence encoding GAS 159 

ATGCGTAAACTTTATTCCTTTC 

CTTGCAGAAAAAATCGGGTTCTGGTAGTCAA 

ATCCAGCTTTGCTCAAAAAATTCACCAAAGAAACGGGCA 

gaagccatgtacactaaaatcaagk:agggc 

TGATAAAATGATCAAAGAAAACCTACTCAATAAGCTTGATAAGTCAAAATTAGTTGGCATGGATAATATCG 
GGAAAGAATTTTTAGGGAAAAGCTTTGACCCACA^ 

GGGATTGTTTATAATGATCAATTAGTTGATAAGGCGCCTATGCACTGGGAAGATCTGTGGCGTCCAGAATA 
TAAAAATAGTATTATGCTGATTGATGGAGC GC GTGAAATGC TAGGGGTTGGTTTAAC AAC TTTTGGTTAT A 

GTGTGAATTCTAAAAATCTAGAGCAGTTGCAGGCAGCCGAGAGAAAACTGCAGCAGTTGACGCCGAATGTT 
AAAGCCATTGTAGCAGATGAGATGAAAGGCTACATGATTCAAGGTGACGCTGCTATTGGAATTACCTTTT 
TGGTGAAGCCAGTGAGATGTTAGATAGTAACGAACACCTTCACTACATCGTGCCTTCAGAAGGGTCTAACC 
TTTGGTTTGATAATTTGGTACTACCAAAAACCATGAAACACGAAAAAGAAGCTTATGCTTTTTTGAACTTT 
ATC AATCGTC CTGAAAATGCTGC GC AAAATGCTGCATATATTGGTTATGCGACACCAAATAAAAAAGCCAA 

GGCCTTACTTCCAGATGAGATAAAAAATGATCCTGCTTTTTATCCAACAGATGACATTATCAAAAAATTGG 
AAGTTTATGACAATTTAGGGTCAAG ATGGTTGGGGA 

CGCAAATAA " " — 

SEQ ID NO: 63 amino acid sequence comprising N-terminal leader sequence of GAS 159 

MRKLYSFLAGVLGVIVILTSLSFI 

SEQ ID NO: 64 amino acid sequence comprising a fragment of GAS 159 where the N-terminal 
leader sequence is removed 

LQKKSGSGSQSDKLVIYIWGDYIDPALLKKFTKETGIEVQYET^ 

DKMI KENLLNKLDKSKLVGMDNIGKEFLGKS FDPQNDYSDP YFWGTVGIVYNDQLVDKAPMHWEDLWRPEY 
KNSIMDIDGiUlEMLGVGLTTFGYSVNSKNL^ 

GEASEMLDSNEHLHYIVPSEGSNLWFDNLVLPKTMKHEKEAYAFLNFINRPENAAQ 
ALL PDEI KNDPAFYPTDDI I KKLEVYDNLGS RWLGI YNDLYLQFKMYRK 

SEQ ID NO: 65 amino acid sequence comprising C-terminal hydrophobic sequence of GAS 159 

WLG I YNDLYLQFKMYRK 

« 

SEQ ID NO: 66 amino acid sequence comprising a fragment of GAS 159 where the C-terminal 
hydrophobic region is removed 

MRKLYSFDAGVLGVIVILTSLS^ 

EAMYTKI KQGGTTYDIAVP SDYTIDKMIKENIiLNKLDKSKLVGMDNIGKEFIrGKS FD PQNDYSIiPYFWGTV 
GIVYlTOQDVDKAPMHWEDLWRPEYKNSIMLIDGAREMLGVGIiTO 

KAI VADEMKGYMIQGDAAIGITF SGEASEMLD SNEHLHYI VPS EGSNLWFDNLVL PKTMKHEKEAYAFLNF 
INRPENAAQNAAYIGYATPNKKAKALLPDEIKNDPAFYPTDDIIKKLEVYDNIjGSR 

SEQ ID NO: 67 amino acid sequence comprising a fragment of GAS 159 where the N-terminal 
leader sequence and the C-terminal hydrophobic region is removed 

LQKKSGSGSQSDKLVIYNWGDYIDPAIjLKKFTC 
DKMIKENLLNKLDKSKLVGltoNIGKEFLGKSF 
KNS IKLIDGAREMLGVGLTTFGY 

GEASEMLD SNEHLHYIVPSEGSNLWFDNLVL PKTMKHEKEAYAFLNF I NRPENAAQNAAYIGYATPNKKAK 
ALL PDEI KNDPAFYPTDDI I KKLEVYDNLGSR 

SEQ ID NO: 68 amino acid sequence comprising GAS 217 

MAQRI I VITGASGGLAQAIVKQLPKEDSLILLGRNKERLEHC YQHIDNKECLELDITNPVAI EKMVAQIYQ 
RYGRIDVLimAGYGAFKGFEEFSAQEIADMFQVNTLASIHFACLIGQKMAEQGQGHLINIVSMAGLIASA . 
KSSIYSATKFALIGFSNALRLELADKGVYVTTVNPGPIATKFFDQADPSGHYLES^ 
S 1 1 GKNKRELNLPFSLAVTHQF YTLF PKLSDYLARKVFNYK 

♦ 

SEQ ID NO: 69 polynucleotide sequence encoding GAS 217 
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ATGGCACAAAGAATCATTGTTATCACGGGAGCTTCTGGAGGACTGGC 
CAAGGAAGACAGCTTGATTTTATTAGGACGTAACAAAGAACGC^ 

ACAAAGAATGCCTCGAGTTGGATATTACCAATCCAGTAGCCATTGAGAAAATGGTCGCCCAGATTTA 

C GC TATGGCC GTATTGATGTCTTGATTAATAATGCTGGCTAC GGAGCTTTCAAAGGCTTTGAAGAGTTTTC 

TGCCCAAGAAATAGCTGATATGTTTCAGGTTAACACCCTAGCGAGCAT^ 

AGAAAATGGCAGAGCAGGGGCAAGGTCACCTTATTAATATTGTGTC^ 

AAATC GAGC ATTTATTCAGC C ACC AAGTTTGCCCTTATCGGATTTTC C AATGCCCTTC GCTTAGAATTAGC 

GGATAAAGGGGTTTACGTGACCACCGTGAATCCAGGTCCCATTGC^ 

CGTCTGGACATTATTTGGAAAGCGTTGGTAAATTTACTCTCCAACCAA 

TCTATTATCGGGAAAAATAAACGAGAATTGAATTTGCCCTTTAGTTTAGCGGTC 

CCTTTTCC C TAAATTATCTGATTATCTTGC AAGAAAGGTATTTAATTATAAATGA 

SEQ ID NO: 70 amino acid sequence comprising GAS 309 

MIEKYLESSIESKCQLIVIjFFKTSYLiPITWAEKTGLTFIiQIiNHyCEELNAFFPGSLSMTIQKRMISCQFT 

HPFKETYIjYQLYAS SNVLQLIjAFIiIKNGSHSRPLTDFARSHFLSNS SAYRMREAIiIPIjIjRNFELKLSKNKI 

VGEEYRIRYLIALL,YSKFGIKVYDLiTQQDKOTra 

WIPQTRIFQQLKKLFVYDSLKKSSHDIIETYCQLNFSAGD^^ 

LFEENDTFRLLLNPIITLLP^ 

IVEEWMAKLPGKRDLNHKHFHLFCHYVEQSLIOT 

YDLQDNVYQI PDLKPDLVITHSQLI PFVHHELTKGI AVAEI SFDES ILS I QELMYQVKEEKFQADLTKQLT 

» 

SEQ ID NO: 71 polynucleotide sequence encoding GAS 309 

TTGATAGAAAAATACOTGGAATCATCAATCGAATCAAAATGTCA 

TTATTTGCCAATAACTGAGGTAGCAGAAAAAACTGGCTTAACCTTTTTACAACTAAACCATTATTGTGAGG 
AACTGAATGCCTTTTTCCCTGGTAGTCTGTCTATGACCATCCAAAAAAGGATGATATCTTGCCAATTTACA 
CATC C TTTTAAAGAAACTTATCTTTAC C AACTCTATGC ATC ATCTAATGTC TTAC AATTAC TAGC CTTTTT 
AATAAAAAATGGTTCCCACTCTCGTCCCCTTACGGATTTTGCAAGAAGTCATT^ 

CTTATCGGATGCGCGAAGCATTGATTCCTTTATTAAGAAACTTTGAATTAAAACTCTCTAAGAACAAGATT 
GTCGGTGAGGAATATCGCATCCGTTACCTCATCGCTCTGCTATATAGTAAGTTTGGCATTAAAGTTTATGA 
CTTGACGCAGCAAGACAAAAACACTATTCATAGCTTTTTATCCCATAGTTCCACCCACCTTAAAACCTCTC 
CTTGGTTATCGGAATC GTTTTCTTTC T ATGACATTTTATTAGCTTT ATC GTGGAAGCGGCATC AATTTTCG 
GTAACTATTCCCCAAACCAGAATTTTTCAACAATTAAAAAAACTTTTTGTCTACGATTCTTO 
TAGCCATGATATTATCGAAACTTACTGCCAACTAAACTTTTCAGCAGGAGATTTGGACTACCTCTATTTAA 
' TTTATATCACCGCTAATAATTCTTTTGCGAGCTTACAATGGACACCTGAGCATATCAGACAATATTGTCAA 
CTTTTTGAAGAAAATGATACTTTTCGCCTGCTTTTAAATCCTATCATCAC^^ 
GCAAAAGGCTAGTTTAGTAAAAGCTCTTATGTTTTTTTCAAAATCATTCTT^ 
TTCCTGAGACCAACTTATTCGTTTCTCCGTACTATAAAGGAAACCAAAAACTCTATACGTC 
ATTGTCGAAGAGTGGATGGCCAAACTTCCTGGTAAGCGTGACTTGAACCATAAGCATTTTCATCTTTTTTG 
CCACTATGTCGAGCAAAGTCTAAGAAATATCCAACCTCCTTTAGTTGTTGTTTTCGTAGCCAGTAATTTTA 
TCAATGCTCATCTCCTAACGGATTCTTTTCCAAGGTATTTCTCGGATAAAAGCATTGATTTTCATTCCTAT 
TATCTATTGCAAGATAATGTTTATCAAATTCCTGATTTAAAGCCAGATTTGGTCATCACTCACAGTCAACT 
■ GATTC C TTTTGTTC AC C ATGAACTT AC AAAAGGAATTGC TGTTGCTGAAATATCTTTTGATGAATC GATTC 
TGTCTATCCAAGAATTGATGTATCAAGTTAAAGAGGAAAAATTCCAAGCTGATTTAACCAAGCAAT^ 
TAA 

SEQ ID NO: 72 amino acid sequence comprising GAS 372 

MIQIGKLFAGRYRILKSIGRGGMADVYIjAITOLI^ 

NIVAIRDIGEEDGQQFLVMEYVDGADLKRYIQNHAPLSNNEVVRIMEEVLSAMTLAHQKGIV^ 
LLTKEGWKVTDFGIAVAFAETSLTQTNSI^ 

GDSAWIALQHFQKPLPSIIEENHWPQALENWIRATAKKLSDRYGSTFEM 

FENVESTKPLPKVASGPTASVKIiSPPTPTVLTQESRLDQTNQTDALQPPTKKKKSGRFLGTLFKILFSFFI 
VGVALFTYLILTKPTSVKVPNVAGTSIiK^^ 

SSI TLYVS I GNKGFDMENYKGLDYQEAMNSL I ETYGVPKSKI KI ERIVTNEYPENTVI SQ S PSAGDKFNPN 
GKSKI TLSVAVSDTITMPMVTEYSYADAVNTLTALiGIDASRI KAYVPS S S S ATGFVP IH S P S SKAI VSGQS 
PYYGTSLSIiSDKGEISLYIiYPEETHSSSSSSSSTSSSNSSSINDSTAPGSNTELSPSETTSQTP 
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SEQ ID NO: 73 polynucleotide sequence encoding OAS 372 

ATGATTCAC^TTGGGAAATTATO 
GGATGTTTATTTAGCAAATGACTTGATCTTGGAT^ 

ATCAAACAGATCAGGTAGCAGTTGCGCGTTTCCAACGAGAAGCGCGGGCCATGGCTGAATT 
AATATTGTTGCCATCCGGGATATAGGTGAAGA^ 

TGCTGACCTAAAGAGATACATTCAAAATCATGCTCC^ 
AAGTCCTTTCTGCTATGACTTTAGCC^ 

CTACTAACTAAGGAGGGTGTTGTCAAAGTAACTGATTTCGGC^ 

GACACAAACTAATTCGATGTTAGGCAGTGTTCATTACTTGTCTCCAGAACAGGCTCGCGGCTC 
CGATTCAAAGTGATATTTATGCGATGGGGATTATGCTC 

GGCGATAGTGCTGTTACGATTGCCTTGCAACATTTTCAAAAGCCTCTTCCATCTATTATCGAGGAGAACCA 
CAATGTGCCACAAGCTTTGGAGAATGTTGTTATTCGAGCAACAGC^ 

CAACCTTTGAAATGAGTCGTGACTTAATGACGGCGCOTAGTTATAATCGTAGTCGGGAGCGTAAGATTATC 
TTTGAGAATGTTGAAAGTACCAAACCCCTCCC^ 

TCCCCCTACCCCAACAGTGTTAACACAGGAAAGTCGATTAGATCAAACTAATCAAACAGATGCTTTACAGC 

CCCCCACCAAAAAGAAAAAAAGTGGTCGTTTTTTAGGTACTTTATTCAAAATTC 

GTAGGTGTAGCACTCTTTACTTATCTTATA^ 

CACTAGTCOTAAAGTTGCCAAACAAGAACTGTATGATC^ 

AGAGTGATACGGTTGCTGAGGGAAATGTAGTTAGAACAGATCCTAAA 

TCAAGCATTACGCTTTATGTGTCAATTGGAAACAAAGGTTTTGACATGGAAAACTACAAAGGACTAGATT^A 
TCAAGAAGCTATGAATAGTTTGATAGAAACTTATGGTGTTCCAAAATCAAAAATCAAAATTGAGCGCATTG 
TAACTAATGAATATCCTGAAAATACAGTCATCAGTCAATCGCCAAGTGCGGGTGATAAATTTAATCCAAAC 
GGAAAGTCTAAAATTACGCTCAGTGTTGCTGTTAGTGATACGATCACTATGCCTATGGTAACAGAATATAG 
TTATGCAGATGCAGTCAATACCTTAACAGCTTTAGGTATAGATGCATCTAGAATAAAAGCTTATGTGCCAA 
GCTCTAGCTCAGCAACGGGCTTTGTGCCAATTCATTCTCCTAGTTCTAAAGCTATTGTCAGTGGTCAATCT 
CCTTACTATGGAACGTCTTTGAGTCTGTCTGATAAAGGAGAGATTAGTCTTTACCTTTATCCAGAAGAAAC 
ACACTCTTCTAGTAGCTCATCGAGTTCAACGTCAAGTTCAAACAGTTCTTCAATAAATGATAGTACTGCAC 
CAGGTAGCAACACTGAATTAAGCCCATCAGAAACTACTTCTCAAACACCTTAA 

SEQ ID NO: 74 amino acid sequence comprising GAS 39 

MDLXIjFIiLVTjVLLGLGAYIjL 
YQQLTDIRDVLHRSLSDSRDRSDKRLEKINQQ^ 

DSVS KQLESVNKGLGEMRSVAQDVGTLNKVIjSNTKTRGIIiGELiQIjGQI ied imts sqyerefvtvsg s ser 
VEYAIKLPGNGQGGYIYLPIDSKFPLEDYYRLEDAYEVGDKLAIEASRKAIiLAAIKRFA 
TTNFGVMFLPTEGLYSEVVRNASFFDSLRREENIWAGPSTIiSALLNS 
VKLEFDKFGGIiLAKAQKQMtTTANlTrLDQIjISTRTNAIVRALNTVETYQD 

SEQ ID NO: 75 polynucleotide sequence encoding GAS 39 

ATGGACCTTATCTTGTTCCTTTTGGTCTTGGTTCTCTTAGGTTTAGGGGCTTATCTGTTGTTCAAAGTCAA 

CGGCCTTCAACATCAGCTTGCCCAAACCCTAGAAGGCAACGCGGATAATTTGTCTGACCAAATGACCTACC 

AGTTGGATACAGCTAACAAACAACAATTGTTAGAGCTAACACAGCTGATGAACCGACAACAAGCAGGCCTT 

TACCAACAATTAACAGATATTCGTGACGTCTTGCACCGTAGTTTGTCTGATAGTAGGGACCGGTCTGACAA 

ACGCTTAGAAAAAATTAACCAGCAGGTCAACCAATCGCTCAAAAATATGCAAGAATCTAACGAAAAACGTT 

TGGAGAAAATGCGCCAGATCGTTGAAGAAAAATTGGAAGAAACCTTAAAAAATCGTCTGCACGCCTCTTTC 

GATTCTGTATCCAAGCAACTAGAAAGTGTCAATAAAGGCTTGGGAGAAATGCGTAGCGTGGCTCAAGATGT 
GGGTACTTTAAATAAGGTTTTGTCCAATACC^ 

TCATTGAGGATATCATGACATCAAGCCAGTACGAAAGAGAATTTGTAACGGTTAGTGGTTCTAGTGAACGC 

GTAGAATATGCGATTAAGCTCCCAGGAAATGGTCAAGGCGGTTAT^ 

CCCTCTTGAAGATTATTACCGATTAGAAGATGC 

GAAAAGCACTTCTGGCAGCTATCAAACGCTTTGCCAAAGACATTCATAAAAAGTACTTGAACCCCCCAGAG 
ACGACCAATTTCGGAGTTATGTTCTTACCAACAGAAGGTCTTO 

CTTTGATAGCCTTCGTCGGGAAGAAAATATTGTGGTTG 

CCTTATCTGTTGGTTTCAAGACCCTTAATATCCAAAAAAATGCTGATGACATCAGTAAAATTTTAGGCAAT 

GTCAAGTTAGAATTCGATAAATTTGGCGGCCTGCTTGCCAAGGCTCAAAAACAAATGAATACAGCTAATAA 
TACGCTGGATCAGCTCATTTCAACAAGGACAAATGCCAOT 

AAGACCAAGCAACAAAATCTCTCTTGAACATGCCCTTATTAGAAGAGGAAAATAATGAAAATTAA 
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« 

SEQ ID NO: 76 amino acid sequence comprising GAS 42 

MTKEKLVAFSQAHAEPAWLQERRLAALEAI PNLELPT I ERVKFHRWNLGDGTI/TENESLASVPDF I AXGDN 

PKLVQVGTQTVLEQIjPMALIDKGWFSDFYTALEEI pevi eahfgqalafdedklaayhtayfnsaavlyv 

PDHLEITTPIEAIFLQDSDSDVPFNKHVLVIAGK^ 

FSAIDRLGPSVTTYISRRGRLEKDANIDWAIJWMNEGNVIADFDSDLIGQGSQ 
TRVTTJYGQRTVGHILQHGVILERGTIiTFN^ 

TAGHAAS IGQVDPEDMYYIiMSRGLDQETAERLVIRGFLGAVTAEI P I P SVRQEI IKVLDEKLLNR 

SEQ ID NO: 77 polynucleotide sequence encoding GAS 42 

ATGACAAAAGAAAAACTAGTGGCTTTTTCGCAAGCCC^ 

AGCGGCATTAGAAGCCATTCCAAATTTGGAATTACCAACCATCGAAAGGGTTAAATTTCAC 
TAGGAGATGGTACCTTAACAGAAAATGAAAGTCTAGCTAGTGTTCCAGATTTTATAGCTATTGGAGATAAC 
CCAAAGCTTGTTCAGGTAGGCACGCAAACAGTCTTAGAACAGTTACCAATGGCGTTAATTGACAAGGGAGT 
TGTTTTCAGTGATTTTTATACGGCGCTTGAGGAAATCCCAGAAGTAATTGAAGCTC 

TAGCTTTTGATGAAGACAAACTAGCTGCCTACCACACTGCTTATTTTAATAGCGCAGCCGTGCTCTACGTT 
CCTGATCACTTGGAAATCACAACTCCTATTGAAGCTATTTTCTTACAAGATAGTGACAGTGACGTTCCT^ 
TAACAAGCATGTTCTAGTGATTGCAGGAAAAGAAAGTAAGTTCACCTATTTAGAGCGTTTTGAATCTATTG 
GCAATGCCACTCAAAAGATCAGCGCTAATATCAGTGTAGAAGT 

TTCTCGGCTATCGACCGCTTAGGTCCTTCAGTGACAACCTATATTAGCCGTCGAGGACGTTTAGAGAAGGA 

TGCCAACATTGATTGGGCCTTAGCTGTGATGAATGAAGGCAATGTCATTGCTGATTTTGACAGTGATTTGA 

TTGGTCAGGGCTCACAAGCTGATTTGAAAGTTGTTGCAGCCTCAAGTGGTCGTCAGGTACAAGGTATTGAC 

ACGCGCGTGACCAACTATGGTCAACGTACGGTCGGTCATATTTTACAGCATGGTGTGATTTO 

CACCTTAACGTTTAACGGGATTGGTCATATTCTAAAAGACGCTAAGGGAGCTGATGCTCAACAAGAAAGCC 

GTGTTTTGATGCTTTCTGACCAAGCAAGAGCCGATGCCAATCCAATCCTCTTAATTGATGAAAATGAAGTA 

ACAGCAGGTCATGCAGCTTCTATCGGTCAGGTTGACCCTGAAGATATGTATTACTTGATGAGTCGAGGACT 

GGATCAAGAAACAGCAGAACGATTGGTTATTAGAGGATTCCTAGGAGCGGTTATCGCTGAAATTCCTATTC 

C ATC AGTCC GC C AAGAGATTATTAAGGTTTTAG ATGAG AAATTGC TTAATC GTTAA 

SEQ ID NO: 78 amino acid sequence comprising GAS 58 

MKWSGFMKTKSKRFIjNIjATLCIiAIjLGTTLIiMAH PVQAEVI SKRDYMTRFGI1GDI1EDD SANYP SNLEARYKG 
YLEGYEKGLKGDDIPERPKIQVPEDVQPSDHGDYRDGYEEGFGEGQHKRDPLETEAEDDSQGGRQEGRQGH 
QEGADSSDLNVEESDGLSVIDEWGVIYQAFSTIWTYIiSGLF 

SEQ ID NO: 79 polynucleotide sequence encoding GAS 58 

ATGAAATGGAGTGGTTTTATGAAAACAAAATCAAAACGCTTTO 

ACTAGGAACAACTTTGCTAATGGCAC ATCCCGTACAGGCGGAGGTGATATCAAAAAGAGACTATATGACTC 
GCTTCGGGTTAGGCGATTTAGAAGATGATTCAGCTAACTATCCTTCAAATTTAGAAGCTAGATATAAAGGA 
TATTTAGAGGG AT ATGAAAAAGGC TTAAAAGGAGATGATATACCCGAACGGCCCAAGATTCAGGTTCCTGA 
GGATGTTCAGCCATCTGACCATGGCGACTATAGAGATGGTTATGAGGAAGGATTTGGAGAAGGACAACATA 
AACGTGATCCATTAGAAACAGAAGCAGAAGATGATTCTCAAGGAGGACGTCAAGAAGGACGTCAAGGACAT 
CAAGAAGGAGCAGATTCTAGTGATTTGAACGTTGAAGAAAGCGACGGTTTGTCTGTTATTGATGAAGTAGT 
TGGAGTAATTTATCAAGCATTTAGTACTATTTGGACATACTTAAGCGGTTTGTTCTAA 

SEQ ID NO: 80 amino acid sequence comprising N-terminal leader sequence of GAS 58 

MKWSGFMKTKSKRFLNLATLCIiALIX3TTLLMA 

SEQ ID NO: 81 amino acid sequence comprising a fragment of GAS 58 where the N-terminal 
leader sequence is removed 

HPVQAEVI SKRDYMTRFGLGDDEDDSANYPSNIjEARYKGYLEGYEKGLKGDDI PERPKIQVPEDVQPSDHG 

DYRDGYEEGFGEGQHKRDPLETEAEDDSQGGRQEGRQGHQEGADSSDIiNVEESDGIiSVIDEWGVIYQAFS 
TIWTYLSGLF 

SEQ ID NO: 82 amino acid sequence comprising GAS 290 



22/38 



WO 2005/032582 



PCT/US2004/024868 

• 



SEQUENCE LISTING 

MKHILFIVGSLREGSFNHQLAAQAQKA^ 
IFTPVYNFSIPGSVKJ^LDWLSRALDLSDPTC 
EFTKATVNPDAWGTGRLEI S KETKANLL S QAEALLiAAI 

SEQ ID NO: 83 polynucleotide sequence encoding GAS 290 

ATGAAACATATTTTATTTATTGTTGGCTCGCTTCGTGAAGGGTCTTTTAACCATCAATTAGCGGCTCAAGC 

ACAAAAAGCTCTGGAACATCAAGGAGTTGTATCTT^ 

ATATCGAAGCTAATGCACCTTTACCAGTTGTO 

ATOTTTACACCAGTTTACAACTTCTC 

TCTTG ATTTGTC TGATC C GACGGGC C C ATCTGCT ATTGGC GGTAAGGTGGTT ACGGTCTCTTC AGTTGC AA 
ATGGCGGGCATGATCAAGTATTTGATCAGTTTAAAGCACTATTGCCGTTTATCCGAACTTCAGTAGCAGGA 
GAGTTTACAAAAGCAACTGTGAATCCTGATGCCTGGGGAACAGGAAGGCTTGAGATTTCAAAAGAGACAAA 
AGCAAACTTGCTATCTCAGGCAGAGGCTCTTTTAGCGGCTATTTAG 

SEQ ID NO: 84 amino acid sequence comprising GAS 511 

*OTDVSRIIjKEARDQGRLTTIJ)YAN^ 
LARNFGQPNPEGYRKALRLMKQAEKFGRPVVTF^ 

I IGEGGSGGALAIiAVADQVV^ENTMYAVXiS PEGFAS ILWKDGSRATEAAELMKITAGEDYKMGI VDRI I P 
EHGYFSSE IVDI IKANLIEQITSLQAKPLDQLLDERYQRFRKY 

SEQ ID NO: 85 polynucleotide sequence encoding GAS 511 

ATGACAGATGT ATCAAGAATTTTAAAAGAAGC GCGTGATC AAGGGCGTTTAAC AACTTTGGATTAC GC CAA 
CCTTATTTTCGATGACTTTATGGAACTGCATGGCGATCGCCATTTTTCAGATGATGGTGCCATTGTAGGTG 
GCCTAGCTTATTTGGCGGGACAACCTGTTACGGTCATTGGTATTCAAAAAGGTAAGAATTTACAGGATAAT 
TTGGCAAGGAATTTTGGCCAGCCCAATCCAGAAGGTTATCGTAAAGCTTTGCGCCTTATGAAACAGGCAGA 
AAAATTTGGACGACCAGTTGTTACGTTTATCAATACTGCAGGAGCCTATCCAGGTGTCGGTGCGGAAGAAC 
GAGGACAGGGTGAGGCCATTGCTAAAAATTTGATGGAAATGAGTGATCTCAAGGTTCCCATTATCGCCATC 
ATTATTGGTGAAGGAGGCTCTGGTGGTGCATTAGCCTTAGCGGTTGCCGATCAGGTCTGGATGCTTGAAAA 
TACTATGTATGCGGTTCTTAGCCCAGAAGGCTTTGCTTCTATTTTATGGAAGGATGGTTCAAGGGCGACCG 
AGGCCGCTGAATTGATGAAAATCACAGCGGGTGAACTCTACAAAATGGGAATAGTAGACCGTATTATTCCA 
GAACATGGTTATTTTTCAAGTGAAATCGTTGACATCATCAAAGCTAACCTCATCGAACAAATAACCAGTTT 
GCAAGCTAAGCCATTAGACCAATTATTAGATGAGCGCTACCAACGCTTTCGTAAATATTAA 

SEQ ID NO: 86 amino acid sequence comprising GAS 533 

MAITVADIRREVKEKNVTFLRL^ 

YPDLDTWIVFPWGDENGAVAGLICDIYTAEGKPFAGDPRGOT 

DKGOTTLEVKTONGGYFDIjAPID^^ 

IFKLWKTIAREHGLYATFMAKPKFGIAGSGMHCl^ 

HAYim^AITNPTVNSYKRIiVPGYEAPVYVAWAGSimSPLIRVPASRGMGTRL 

EAGLDGI INKI EAPEFVEANI YTMTMEERNEAGI IDLPSTLHNALKALQKDDWQKALGYHIYTNFIjEAKR 
IEWSSYATFVSQWEIDHYIHNY 

SEQ ID NO: 87 polynucleotide sequence encoding GAS 533 

ATGGCAATAACAGTAGCTGACATTCGTCGTGAAGTCAAAGAAAAAAATGTAACGTTTCTTCGCTTGATGTT 
CACTGATATCATGGGCGTTATGAAAAATGTGGAGATTGCTGCAACTAAAGAACAGTTAGACAAAGTATTGT 
CTAACAAGGTTATGTTTGATGGTTCATCTATCGAAGGTTTTGTACGGATCAATGAGTCAGATATGTACCTT 
TACCCCGATTTAGACACTTGGATTGTTTTTC 

TGATATTTATACAGCAGAAGGAAAGCCTTTTGCAGGAGATCCTAGAGGAAATTTAAAAAGAGCCCTGAAAC 

ACATGAACGAGATCGGCTACAAATCATTTAATCTTGGAC 

GATAAAGGTAATCCGACACTTGAAGTTAACGATAATGGTGGTTATO 

AGACAACACGCGCCGTGAAATTGTGAATATTTTAACGAAAATGGGTTTTGAAGTGGAAGCTAGTCATCATG 
AAGTGGCTGTTGGTCAACATGAGATTGATTTTAAATATGCAGATGTTTTGAAAGCTTGTGATAATATTCAA 
ATTTTTAAGCTAGTTGTAAAAACGATTGCCCGTGAACATGGACTTTATGCTACTTTCATGGCTAAACCAAA 
ATTTGGAATAGCTGGATCAGGGATGCACTGTAACATGTCTTTGTTTGATAACCAAGGTAATAATGCTTTTT 
ATGATGAAGCTGATAAGCGAGGGATGCAGTTATCAGAAGATGCTTATTATTTCTTGGGAGGACTAATGAAG 
CATGCTTATAACTACACTGCTATCACTAACCCTACAGTGAATTCTTATAAACGATTAGTTCCAGGTTATGA 
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GGCACCTGTTTATGTCGCTTGGGCTGGAAGTAATCGTTCACCGCTTATC 
TGGGAAOGCGTTTGGAGTTACGTTCGGTTGATCCG 

GAAGCTGGATTAGATGGTATCATTAACAAAATTGAAGCTCCAGAACCCGTTGAAGCTAACATTTATACCAT 
GAC AATGGAAGAAC GAAATGAAGC AGGCATTATTGATTTGC CATC AACGCTTC ATAATGC CTTAAAAGCTC 
TTCAAAAAGATGATGTGGTACAAAAGGCACTAGGTTACCATATCTACACTAATTT 
ATTGAATGGTCTTCCTATGCAACTTTTGTTTCTC 

SEQ ID NO: 88 amino acid sequence comprising GAS 527 

MTEISILISTOVQKIIVLDYGSQYNQL^ 

FGIDPEIFELGI P IIjGICYGMQIjITHKIjGGKVVP AGQAGNREYGQSTLHIjRET SKLF SGTPQEQLjVLMSHG 
DAVTEI PEGFHLVGDSNDC PYAAI ENTEKNLYGI QFH PEVRHS VYGNDILKNF AI S ICGARGDWSMDNF ID 
MEIAKIRETVGDRKVIiLGLSGGXroSSWGVIiliQKAIGDQIjTCIFVD 

RVDASKRFLDLDADVEDPEKKRKI IGNEFVYVFDDEAS KLKGVDFLAQGTtiYTDI I E SGTETAQTI KSHHN 
VGGIiPEDMQFELIEPIiNTLFKDE\mAIiGIALGMPEEIVWRQPFPGPGLAI 

REE I AKAGIjDRDVWQYFTVNTGVRS VGVMGDGRTYDYT I AIRAIT S IDGMTADFAQIiPWDVLKKI STRI VN 
EVDHVNRIVYDITSKPPATVEWE 

« 

SEQ ID NO: 89 polynucleotide sequence encoding GAS 527 

ATGACTGAAATTTCAATTTTGAATGATGTTCAAAAAATTATCGTTCTTGATTATGGTAGCCAGTACAATCA 

GCTTATTGCTAGACGTATTCGAGAGTTTGGTGTTTTCTCCGAACTAAAAAGCCATAAAATCACCGCTCAAG 

AACTTCGTGAGATCAATCCCATAGGTATCGTTTTATCAGGAGGGCCTAACTCTGTTTACGCTGATAACGCC 

TTTGGCATTGACCCTGAAATCTTTGAACTAGGGATTCCGATTCTTGGTATCTGTTACGGTATGCAATTAA^ 

CACCCATAT^ATTAGGTGGTAAAGTTGTTCCTGCTGGACAAGCTGGTAATCGTGAATACGGTCAGTCAACCC 

TTCATCTTCGTGAAACGTCAAAATTATTTTCAGGCACACCTCAAGAACAACTCGTTTTGATGAGCCATGGT 

GATGCTGTTACTGAAATTCCAGAAGGTTTCCACCTTGTTGGAGACTCAAATGACTGTCCCTATGCAGCTAT 

TGAAAATACTGAGAAAAACCTTTACGGTATTCAGTTCCACCCAGAAGTGAGACACTCTGTTTATGGAAATG 

ACATTCTTAAAAACTTTGCTATATCAATTTGTGGCGCGCGTGGTGATTGGTCAATGGATAATTTTATTGAC 

ATGGAAATTGCTAAAATTCGTGAAACTGTAGGCGATCGTAAAGTTCTTCTAGGTCTTTCTGGTGGAGTTGA 

TTCTTC AGTTGTTGGTGTTCTACTTC AAAAAGCTATCGGTGAC C AATTAACTTGTATTTTC GTTGATC ACG 

GTCTTCTTCGTAAAGACGAGGGCGATCAAGTTATGGGAATGCTTGGGGGCAAATTTGGCCTAAATATTATC 

CGTGTGGATGCTTCAAAACGTTTCTTAGACCTTCTTGCAGACGTTGAAGATCCTGAGAAAAAACGTAAAAT 

TATTGGTAATGAATTTGTC TATGTTTTTGATGATGAAGCCAGC AAATTAAAAGGTGTTGACTTCCTTGC CC 

AAGGAACACTTTATACTGATATCATTGAGTCAGGAACAGAAACTGCTCAAACCATCAAATCACATCACAAT 

GTGGGTGGTCTCCCCGAAGACATGCAGTTTGAATTGATTGAGCCCTTAAACACTCTTTTCAAAGATGAAGT 

TCGAGCGCTTGGAATCGCTCTTGGAATGCCTGAAGAAATTGTTTGGCGCCAACCATTTCCAGGTCCTGGAC 

TTGCTATCCGTGTCATGGGAGCAATTACTGAAGAAAAACTTGAAACCGTTCGCGAATCAGACGCTATCCTT 

CGTGAAGAAATTGCTAAGGCTGGACTTGATCGTGACGTGTGGCAATACTTTACAGTTAACACAGGTGTCCG 

TTCTGTAGGCGTCATGGGAGATGGTCGTACTTATGATTATACCATCGCCATTCGTGCTATTACGTCTATTG 

ATGGTATGACAGCTGACTTTGCTC AACTTCCTTGGGATGTCTTGAAAAAAATCTC AAC AC GTATC GTAAAT 

GAAGTTGACCACGTTAACCGTATCGTCTACGACATCACAAGTAAACCACCCGCAACAGTTGAATGGGAATA 

A 

SEQ ID NO: 90 amino acid sequence comprising GAS 294 

MSQSTATYI WIGAGLAGSEAAYQIAKRGI PVKLYEM^^ 

. LKEEMRRLDS I IMRNGEANRVPAGGAMAVDREGYAESVTAELENHPIjIEVIRGEITEI PDDAITVI ATGPL 
TSDAIAEKIHAIiNGGDGFYFYDAAAPIIDKSTIDMSKVYDKSRYDKGEAAYLN 

EEAPLNAFEKEKYFEGCMPI EVMAKRGI KTMLYGPMKPVGLEYPDDYTGPRDGEFKTP YAVVQLRQDNAAG 

SLYNIVGFQTHLKWGEQKRWQMIPGI/ENAEFVRYGVM^ 

EGYVESAASGLVAGINAARLFKREEALIFPQTTAIG 

KERYEAIASRALADLDTCLASL 

SEQ ID NO: 91 polynucleotide sequence encoding GAS 294 

TTGTCTCAATCAACTGCAACTTATATTAATGTTATTGGAGCTGGGCTAGCTGGTTCTGAAGCTGCCTATCA 
GATTGCTAAGCGCGGTATCCCCGTTAAATTGTATGAAATGCGTGGTGTCA7VAGCAACACCGCAACATAAAA 
CCACTAATTTTGCCGAATTGGTCTGTTCCAACTCATOT 

CTCAAAGAAGAAATGCGGCGATTAGACTCC^TTATTATGCGTAATGGTGAAGCTAACCGCGTACCTGCTGG 
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GGGAGCAATGGCTGTTGACCGTGAGGGGTATC 

TTGAGGTC ATTC GTGGTGAAATTACAGAAATCCCTGAC GATGCTATCAC GGTTATCGC GACGGGACC GCTG 
ACTTCGGATGCCCTGGCAGAAAAAATTCACGCGCTAAATGGTGGC GAC GGATTCTATTTTTACGATGC AGC 
AGCGCCTATCATTGATAAATCTACCATTGATATGAGCAAGGTTTACCTTAAATC 

AAGC TGCTTACCTCAAC TGCCCTATGACCAAAGAAGAATTC ATGGC TTTC C ATGAAGCTC TGACAACC GC A 
GAAGAAGC C C C GCTGAATGC CTTTGAAAAAGAAAAGTATTTTGAAGGCTGTATGCCGATTGAAGTTATC 

TAAACGTGGCATTAAAACCATGCTTTATGGACCTATGAT^CCCGTTGGATTGGAATATCCAGATGACTATA 

CAGGTCCTCGCGATGGAGAATTTAAAACGCCATATGCC^ 

AGCCTTTATAATATCGTTGGTOTCCAAACCCA 

TCCAGGGCTTGAAAATGCTGAGTTTGTCCGCTACGGCGTCATGCATCGCAATTCCTATATGGATTCACCAA 
ATCTTTTAACCGAAACCTTCCAATCTCGGAGCAATCCAAACCTTTTCTTTG^ 

GAAGGTTATGTCG2^TCAGCTGCTTCAGGTTTAGTAGCAGGAATCAATGCTGCTCGTTTGTTCAAAAGAGA 
AGAAGCACTTATTTTTCCTCAGACAACAGCTATTGGGAGTTTGCCTCATTATGTGACTCATGCCGACAGTA 
AGCATTTCCAACCAATGAACGTCAACTTTGGGATCATCAAAGAGTTAGAAGGCCCACGCATTCGTGACA 
AAAGAACGTTATGAAGCTATTGCTAGTCGTGCTTTGGCAG 

SEQ ED NO: 92 amino acid sequence comprising GAS 253 

MPKKILFTGGGTVGHVTIiNLIIiI PKFIKDGWEVHY IGDKNGI EHTEIEKSGLDVTFHAI ATGKLRRYFSWQ 
NIjADVFKVALGLIiQSLFIVAKLRPQALF SKGGFVSVP PVVAAKIjLGKPVF IHE SDRSMGLANKI AYKFATT 
MYTTFEQEDQLS KVKHLGAVTKVFKDANQMPE 
LKQRYNIINITGDPHLNELSSHIiYRVDYV^ 

SRGDQLENATYFEKRGYAKQLQEPDLTLHNFDQAMADLFEHQADYEATMLATKE I Q SPDFF YDIiIiRADI S S 
AIKEK 

SEQ ID NO: 93 polynucleotide sequence encoding GAS 253 

ATGCCTAAGAAGATTTTATTTACAGGTGGTGGAACTGTAGGTCATGTCACCTTGAACCTCATTCTCATACC 
AAAATTTATCAAGGACGGTTGGGAAGTACATTATATTGGTGATAAAAATGGCATTGAACATACAGAAATTG 
AAAAGTCAGGCCTTGACGTGACCTTTCATGCTATCGCGACAGGCAAGCTTAGACGCTATTTTTCATGGCAA 
AATCTAGCTGATGTTTTTAAGGTTGCACTTGGCCTCCTACAGTCTCTCTT^ 

TCAAGCCCTTTTTTCCAAAGGTGGTTTTGTCTCAGTACCGCCAGTTGTGGCTGCTAAATTGCTTGGTAAAC 
CAGTCTTTATTCATGAATCAGATCGGTCAATGGGACTAGCAAACAAGATTGCCTACAAATTTGCAACTACC 
ATGTATACCACTTTTGAGCAGGAAGACCAGTTGTCTAAAGTTAAACACCTTGGAGCGGTGACAAAGGTTTT 
CAAAGATGCCAACCAAATGCCTGAATCAACTCAGTTAGAGGCGGTGAAAGAGTATTTTAGTAGAGACCTAA 
AAACCCTCTTGTTTATTGGTGGTTCGGCAGGGGCGCATGTGTTTAATCAGTTTATTAGTGATCATCCAGAA 
TTGAAGCAACGTTATAATATCATCAATATTACAGGAGACCCTCACCTTAATGAATTGAGTTCTCATCTGTA 
TCGAGTAGATTATGTTACCGATCTCTACCAACCTTTGATGGCGATGGCTGACCTTGTAGTGACAAGAGGGG 
GCTCTAATACACTTTTTGAGCTACTGGCAATGGCTAAGCTACACCTCATCGTTCCTCTTGGTAAAGAAGCT 
AGCCGTGGQGATCAGTTAGAAAATGCCACTTATTTTGAGAAGAGGGGCTACGCTAAACAATTACAGGAACC 
TGATTTAACTTTGCATAATTTTGATCAGGCAATGGCTG 

CTATGTTGGCAACTAAGGAGATTCAGTCACCGGACTTCTTTTATGACCTTTTGAGAGCTGATATTAGCTCC 
GCGATTAAGGAGAAGTAA 

SEQ ID NO: 94 amino acid sequence comprising GAS 529 

MCGIVGWGNRNATDILMQGLEKLEYRGTO 
RWATHGQSTEDNAHPHTSQTGRFVIiVH^ 

LEAFKKSL S 1 1 EGS YAFALMDS QATDTI YVAKNKS PLLIGLGEGYNMVC SDAMAMI RETS EFME IHDKELV 
ILTKDKVTVTDYDGKELIRDSYTAEIjDDSDIGK^ 

ITSIQEADRIiYILAAGTSYHAGFATKlSnyU^EQLTOTPVELGVASEWGYHMP 

QVLVKANAMGI PSLTVTNVPG STLSREATYTMLI HAGPEIAVASTKAYTAQ I AALAFLAKAVGEANGKQEA 
LDFNLVH EL S L VAQS I EATL S EKDLiVAEKVQALrLATTRNAFY I GRGND YYVAMEAALKLKE I S Y I QC EGF A 

AGELKHGTISLIEEDTPVIALI SSSQLVASHTRGNIQEVAARGAHVIjTV\^EGIjDREGDDI XVNKVHPFLA 
PI AMVI PTQLI AYYASLQRGIiDVDKPRNLAKAVTVE 

SEQ ID NO: 95 polynucleotide sequence encoding GAS 529 

ATGTGTGGAATTGTTGGAGTTGTTGGAAATCGCAATG 

TGAATACCGGGGTTATGATTCAGCAGGAATTTTTGTGGCTl^TGCCAATCAAACAAACTTGATTAAATCAG 
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TGGGGCGGATTGCTGATTTGCGTGCCAAGATTGGCATTG^ 

CGTTGGGCAACGCATGGCCAATGAACAG 

ACTTGTTCATAATGGTGTGATTGAAAATTACC 

AGGGGCAGACAGATACTGAGATTGCAGTACACTTGATTGGAAAATTTC 

CTGGAAGCTTTTAAAAAATCTTTAAGCATTATTGAAGGTTCCTACGCCTTTGCATTAATGGA 

AAC TGATACTATTTATGTGGCTAAAAACAAGTCTC CATTGTTGATTGGACTTGGTGAAGGTOAC AAC ATGG 

TTTGTTCAGATGCCATGGCCATGATTCGTGAAACCAGTGAATTTATGGAAATO 

ATTTTAACCAAAGATAAGGTAACTGTTACAGACTACGATGGTAAAGAGeTGATACGAGATTC 

TGAATTAGACTTATCTGATATTGGCAAAGGGAC 

CAACCGTAATGCGTCAATTAATTTCAACTTATGCAGATGAAACTGGTAACGTACAGGTO 
ATTACCTCTATCCAAGAGGCTGACCGTCTTTATATTTTAGCGGCAGGGACTTC 

AACAAAAAATATGCTTGAGCAATTGACAGATAC^CCAGTTGAGTTGGGCGTGGCTTCTGAGTGGGGTO 

ACATGCCTCTGCTTAGCAAGAAACCAATGTTTATTCTACTAAGCCAATCAGGAGAAACCGCAGATAGTC 

CAAGTT'TOAGTAAAGGCAAATGCTATGGGCATTCCGAGTTT 

ATCACGTGAAGCAACATACACCATGTTGATTCATGCTGGACCTGAAATTGCTGTTGCGTCTACAAAAGCTT 
ACACTGCACAAATTGCTGCCCTTGCCTTTTTGGCTAAGGCAGTTGGTGAGGCAAATGGTAAGCAAGAAGCT 
CTTGACTTTAACTTGGTACATGAGTTGTCATO^ 

.TCTCGTGGCAGAAAAGGTTCAAGCTTTGCTAGCTACTACTCGTAATGCTTTTTACATCGGGCGTGGCAATG 
ATTATTACGTTGCGATGGAAGCTGCTTTGAAATTAAAAGAGATTTCTTATATTCAATGCGAAGGCTTTGCG 
GCTGGTGAATTGAAACATGGAACCATTTCATTAATTGAGGAGGACACGCCAGTAATCGCTTTAATAT 
TAGTCAGTTGGOTGCCTCTCATACGCGTGGTAATATTCAAGAAGTTGCTGCCCGTGGGGCTCATGTTOT 
CAGTTGTGGAAGAAGGGCTTGACCGTGAGGGAGATGACATTATTGTCAATAAGGTTGATCCTTTCCTAGCC 
CCGATTGCTATGGTCATTCCAACTCAACTGATTGCTTACTACGCTTCATTACAACGTGGACTTG 
TAAGCCACGTAATTTGGCTAAAGCTGTAACAGTAGAATAA 

SEQ ID NO: 96 amino acid sequence comprising GAS 45 

VTFMKKSKWIjAAVSVAILSVSALAA CGNKNASGGS 

GLLENDEYGNIjVP SLAKDWKVS KDGLTYTYTIiRDGVSWYTADGEEYAPVTAEDFVTGLKHAVDDKSDALYV 
VEDS IKNIiKAYQNGEVDFKEVGVKALOT^ 

jSSILVWGAYFLSAFTSKSSMEFHKNENYVTOAKNVGIESVKLTYSDGSDPGSFYKNFD 
TYKSAKKITCADNITYGMLTGDIRHIj^^ 
QTAGQDAKTKALRNMLVPPTFVTIGESDFG 
ALTAEGVTFPVQkDYPVDQANAATVQEAQSFKQSV^ 

DI I S SWWGPDYQDPRTYLDIMS PVGGGSVIQKLG I KAGQNKDWAAAGIiDTYQTIjLDEAAAITDDNDARYK 
AYAKAQ AYIiTDNAVDI PWALGGTPRVTKAVPFSGGF SWAGSKGPLAYKGMKLQDKPVTVKQYEKAKEKWM 
KAKAKSNAKYAEKLADHVEK 

SEQ ID NO: 97 polynucleotide sequence encoding GAS 45 

GTGACTTTTATGAAGAAAAGTAAATGGTTGGCAGCTGTAAGTGTTGCGATCTTGTCAGTATCCGCTTTGGC 
AGCTTGTGGTAATAAAAATGCTTCAGGTGGCTCAGAAGCTACAAAAACCTACAAGTACGTTTTTGTTAACG 
ATCCAAAATCATTGGATTATATTTTGACTAATGGCGGTGGAACGACTGATGTGATAACACAAATGGTTGAT 
GGTCTTTTGGAAAACGATGAGTATGGTAATTTAGTAC^ 

CGGTCTGACTTATACTTATACTCTTCGCGATGGTGTCTCTTGGTATACGGCTGATGGTGAAGAATATGCCC 
CAGTAACAGCAGAAGATTTTGTGACTGGTTTGAAGCACGCGGTTGACGATAAATCAGATGCTCTTTACGTT 
GTTGAAGATTCAATAAAAAACTTAAAGGCTTAC^ 

AGCCCTTGACGATAAAACTGTTCAGTATACTTTGAACAAGCCTGAAAGCTACTGGAATTCAAAAACAACTT 
ATAGTGTGCTTTTCCCAGTTAATGCGAAATTTTTGAAGTCAAAAGGTAAAGATTTTGGTACAACCGATCCA 
TCATCAATCCTTGTTAATGGTGCTTACTTCTTGAGC^ 
AAATGAAAACTACTGGGATGCTAAGAATGTTGGGATAGAATCTGTTA 

ACCCAGGTTCGTTCTACAAG7VACTTTGAGAAGGGTGAGTTCAGCGTTGCACGACTTTACCCAAATGACCCT 

ACCTACAAATCAGCTAAGAAAAACTATGCTGATAACATTACTTACGGAATGTTGACTGGAGATATCCGTCA 

TTTAACATGGAATTTGAACCGTACTTCTTTCAAAAACACTAAGAAAGACCCTGCACAACAAGATGCCGGTA 

AGAAAGCTCTTAACAACAAGGATTTTCGTCAAGCTATTCAGTTTGCTTTTGACCGAGCGTCATT 

CAAACTGCAGGTCAAGATGCCAAAACAAAAGCCTTACGTAACATGCTTGTCCCACCAACATTTGTGACCAT. 

TGGAGAAAGTGATTTTGGTTCAGAAGTTGAAAAGGAAATGGCAAAACTTGGTGATGAATGGAAAGACGTTA 

ACTTAGCTGATGCTCAAGATGGTTTCTATAATCCTGAAAAAGCAAAAGCTGAGTTTGCAAAAGCCAAAGAA 
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GCTTTAACAGCTGAAGGTGTAACCTTCCCAGTTCAATTAGATTACCCTGTTGACCAAGCAAACGCA(k;AAC 
TGTTCAGGAAGCCCAGTCTTTCAAACAATCTGTTGAAGC^TC^ 

TTCTTCAAACAGAAACATC^CTCACGAAGCCCAAGGCTTCTATGCTGAGACCCCAGAACAA 

GATATCATTTCATCATGGTGGGGACCAGACTATCAAGATCCACGGACCTACCTTGACATCATGAGTCCAGT 

AGGTGGTGGATCTGTTATCCAAAAACTTGGAATCAAAGCAGGTCAAAATAAGGATGTTGTGGCAGCTGCAG 

GCCTTGATACCTACCAAACTCTTCTTGATGAAGCAGCAGCAATTACAGACGACAACGATGCGCGCTATAAA 

GCTTACGCAAAAGCACAAGCCTACCTTACAGATAATGCCGTAGATATTCCAGTTGTGGCATTGGGTGGCAC 

TCCACGAGTTACTAAAGCCGTTCCATTTAGCGGGGGCTTCTCTTGGGCAGGGTCTAAAGGTCCTCTAGCAT 

ATAAAGGAATGAAACTTCAAGACAAACCTGTCACAGTAAAACAATACGAAAAAGCAAAAGAAAAATGGATG 

AAAGCAAAGGCTAAGTCAAATGCAAAATATGCTGAGAAGTTAGCTGATCACGTTGAAAAA 

SEQ ID NO: 98 amino acid sequence comprising an N-terminal leader sequence of GAS 45 

VTFMKKSKWIiAAVSVAILSVSALAA 

SEQ ID NO: 99 amino acid sequence comprising a fragment of GAS 45 where the N-terminal 
leader sequence is removed 

CGNKNASGGS EATKTYKYVFVNDPKS IjDYILTNGGGTTDVITQMVDGLLENDEYGNLVPSLAKDWKVSKDG 
LTYTYTLRDGVSWYTAD^ 

LDDKTVQYTLNKPE SYWNSKTTYSVIjF PVNAKFLKSKGKDFGTTDP SS ILVNGAYFLS AFTSKS SMEFHKN 
ENYWDAKWGIESVKLTYSTCSDPGSFYK^DKGEFSVARLYPlSro 

TWNLNRTSFKNTKKDPAQQDAGKKALNNKDFRQAIQFAFDRASFQAQ ' 
ESDFGSEVEKEMAKLGDEWKDVNLADAQDGFYNPEKAKAEFA^^ 

QEAQSFKQS VEASIiGKENVIVNVIjETETSTHEAQGFYAETPEQQDYDI I S SWWGPDYQDPRTYLDIMS PVG 
GGSVI QKLGI KAGQNKDVVAAAGLDTYQTLLDEAAAITDDITOARYKAYAKAQAYIjTDNAVDI PWALGGTP 
RVTKAVPFSGGFSWAGSKGPLAYKGM^ 

SEQ ID NO: 100 amino acid sequence comprising GAS 95 

MKIGKKIVLMFTAIVLTTV LALGVYM 

SERASKWEGNSDSMI LVTVNPKTKKTTMTSLERDTIiTTLS GPKNNEMNGVEAKLiNAAYAAGGAQMAIMTVQ 
DLLNITIDNYVQINMQGZjIDLVNAVGGITVTNEFDFPI S IAENEPEYQATVAPGTHKINGEQALVYARMRY 
DDPEGDYGRQKRQREVI QKVLKKILALDSI SSYRKIIiSAVSSNMQTNIEI SSRTI PSLLGYRDALRTIKTY 
QLKGEDATLSDGGS YQ IVTSNHLLEI QNRIRTELGLHKVNQLKTNATVYElsnijYGSTKS QTVNNNYDS SGQA 
PSYSDSHSSYANYSSGVDTGQSASTDQDSTASSHRPATPSSSSDALAADESSSSGSGSLVPPANINPQT 

SEQ ID NO: 101 polynucleotide sequence encoding GAS 95 

ATGAAAATTGGAAAAAAAATAGTTTTAATGTTCACAGCTATTGTGTTAACAACTGTCTTG GCATTAGGTGT 
CTA T C TAACTAGTGCTTATACCTTCTCAA CAGGAGAATT ATP. A A AeiAOPT^fpa a Ari^avprprprport^ ^ ^rppi^ 

CAAACAAAAGTGATGCCATTAAACAAACAAGAGCTTTTTCTATCTTGTTGATGGGTGTTGATACAGGCTCT 
TCAGAGCGTGCCTCCAAGTGGGAAGGAAACAGTGATTCGATGATTTTGGTTACGGTTAATCCAAAGACCAA 
GAAAACAACTATGACTAGTTTAGAACGAGATACCTTAACCACGTTATCTGGACCCAAAAATAATGAAATGA 
ATGGTGTTGAAGCTAAGCTTAACGCTGCTTATGCAGCAGGTGGCGCTCAGATGGCTATTATGACCGTGCAA 
GATCTTTTGAATATCACCATTGATAACTATGTTCAAATTAATATGCAAGGCCTTATTGATCTTGTGAATGC 
AGTTGGAGGGATTACAGTTACAAATGAGTTTGATTTTCCTATCTCGATTGCTGAAAA 

AAGCTACTGTTGCGCCTGGAACACACAAAATTAACGGTGAACAAGCTTTGGTTTATGCTCGTATGCGTTAT 

GATGATCCTGAGGGAGATTATGGTCGACAAAAGCGTCAACGTGAAGTCATTCAAAAGGTATTGAAAAAAAT 

CCTTGCTCTTGATAGCATTAGCTCTTATCGGAAGATTTTATCTGCTGTAAGTAGTAATATGCAAACGAATA 

TCGAAATCTCTTCTCGCACTATCCCTAGTCTATTAGGTTATCGTGACGCACTTAGAACTATTAAGACTTAT 

CAACTAAAAGGAGAAGATGCCACTTTATCAGATGGTGGATCATACCAAATTGTTACCTCTAATCATTTGTT 

AGAAATCCAAAATCGTATCCGAACAGAATTAGGACTTGATAAGGTTAATCAATTAAAAACAAA^ 

TTTATGAAAATTTGTATGGGTCAACTAAGTCTCAGACAGTAAACAACAACTATGACTCTTCAGGCCAGGCT 

CCATCTTATTCTGATAGTCATAGCTCTTACGCTAATTATTCAAGTGGAGTAGATACCGGCCAGAGTGCTAG 

TACAGACCAGGACTCTACTGCTTCAAGCCATAGGCCAGCTACGCCGTCTTCTTCATCAGATGCTTTAGCAG 

CTGATGAGTCTAGCTCATCAGGGTCTGGATCATTAGTTCCTCCTGCTAATATCAACCCTCAGACCTAA 

SEQ ID NO: 102 amino acid sequence comprising N-terminal leader sequence of GAS 95 

MKI GKKIVLMFTAI VLTTVLALGVYIjTS AYTFS 
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SEQ ID NO: 103 amino acid sequence comprising a fragment of GAS 95 where the N^erminal 
leader sequence is removed. 

TGELSKTFKDFSTSSITCSDAIKQTRAFSILLMGVOT^ 

DTLTTLSGPKNNEMNGVEAKLNAAYA^^ 

EDFPI S3^ENEPEYQATVAPGTHKINGEQALVYARiraYro^ 

RKILSAVSSNMQTNIEISSRTIPSLLGYRDALRTIKTYQLKGEDATIiSDGGSYQrVTSNHIiLEIQNRIRTE 

LGLHKVNQLKTNATVYENIiYGSTKSQTVNNJTroSSGQAPSYSDSHSSYAOT 

HRPATPS S SSDAIiAADESSS SGSGSLVPPAN1NPQT 

SEQ ID NO: 104 amino acid sequence comprising GAS 193 

MKKRKLLAVTLLSTILLNSAVPLVVADTSLRNSTSSTDQPTTADTD 

DHKPSHTHPTPPSNDTKQTDQASSEATDKPNKDKNETKQPDSSDQSTPSPKDQSSQKESQNKDGRPTPSPD 

QQKDQTPDKTPEKSADKTPEKGPEKATDKTPEPNRDAPKPIQPPLAAAPVPIPWRESDKDLSKLKPSSRSS 

AAYVRHWTGDSAYTHNLLSRRYGITAEQLDGFLNSLGIHYDKERLNGKRLLEWEKLTGLDVRArVAIA^ 

SSLGTQGVAKEKGAlMFGYGAFDFNPNNAKKYSDEVAIRHMVEDTIIAtnCNQTFERQDLRAKKWSLGQIjDT 

LIDGGVYFTDTSGSGQRRADIMTKLDQWIDDHGSTPEIPEHLKITSGTQFSEVPVGYKRSQPQNVLTYKSE 

TYSFGQCTWYAYNRVKELGYQVDRYMGNGGDWQRKPGFVTTHKPKVGYVVSFAPGQAGADATYGHVAVVEO 
IKEDGSILISESNVMGLGTISYRTFTAEQASLLTYWGDKLPRP 

SEQ ID NO: 105 polynucleotide sequence encoding GAS 193 

ATGAAGAAAAGGAAATTGTTAGCAGTAACACTATTAAGTACCATACTCTTAAACAGTGCAGTGCCATTAGT 
TGTTGCTGATACCTCCTTGCGTAATAGCACATCATCCACTGATCAGCCTACTACAGCAGATACTGATACGG 
ATGACGAGAGTGAAACACCAAAAAAAGACAAAAAAAGCAAGGAAACAGCGTCGCAGCACGACACCCAAAAA 
GACCATAAGCCATCACACACTCACCCAACCCCCCCTTCAAATGATACTAAGCAGACCGATCAGGCATCATC 
TGAAGCTACTGACAAACCAAATAAAGACAAAAACGACACCAAGCAACCAGACAGCAGTGATCAATCCACCC 
CATCTCCCAAAGACCAGTCGTCTCAAAAAGAGTCACAAAACAAAGACGGCCGACCTACCCCATCACCTGAT 
CAGCAAAAAGATCAGACACCTGATAAAACACCAGAAAAATCAGCTGATAAAACCCCTGAAAAAGGACCAGA 
AAAAGCAACTGATAAAACACCAGAGCCAAATCGTGACGCTCCAAAACCCATCCAACCTCCTTTAGCAGCTG 
CTCCTGTCTTTATACCTTGGAGAGAAAGTGACAAAGACCTGAGCAAGCTAAAACCAAGCAGTCGCTCATCA 
GCGGCTTACGTGAGACACTGGACAGGTGACTCTGCCTACACTCACAACCTGTTGTCACGCCGTTATGGGAT 
TACTGCTGAACAGCTAGATCGTTTTTTGAACAGTCTAGGTATTCACTATGATAAAGAACGCTTAAACGGAA 
AGCGTTTATTAGAATGGGAAAAACTAACAGGACTAGACGTTCGAGCTATCGTAGCTATTGCAATGGCAGAA 
AGCTCACTAGGTACTCAGGGAGTTGCTAAAGAAAAAGGAGCCAATATGTTTGGTTATGGCGCCTTTGACTT 
CAACCCAAACAATGCCAAAAAATACAGCGATGAGGTTGCTATTCGTCACATGGTAGAAGACACCATCATTG 
CCAACAAAAACCAAACCTTTGAAAGACAAGACCTCAAAGCAAAAAAATGGTGACTAGGCCAGTTGGATACC 
TTGATTGATGGTGGGGTTTACTTTACAGATACAAGTGGCAGTGGGCAAAGACGAGCAGATATCATGACCAA 
ACTAGACCAATGGATAGATGATCATGGAAGCACACCTGAGATTCCAGAACATCTCAAGATAACTTCCGGGA 
CACAATTTAGCGAAGTGCCCGTAGGTTATAAAAGAAGTCAGCC^CAAAACGTTTTGACCTACAAGTCAGAG 
ACCTACAGCTTTGGCCAATGCACTTGGTACGCCTATAATCGTGTCAAAGAGCTAGGTTATCAAGTCGACAG 
GTACATGGGTAACGGTGGCGACTGGCAGCGCAAGCCAGGTTTTGTGACCACCCATAAACCTAAAGTGGGCT 
ATGTCGTCTCATTTGCACCAGGCCAAGCAGGAGCAGATGCAACCTATGGTCACGTTGCTGTTGTAGAGCAA 
ATCAAAGAAGATGGTTCTATCTTAATTTCAGAGTCAAATGTTATGGGACTAGGCACCATTTCCTATCGGAC 
GTTCACAGCTGAGCAGGCTAGTTTGTTGACCTATGTCGTAGGGGACAAACTCCCAAGACCATAA 

SEQ ID NO: 106 amino acid sequence comprising GAS 137 

MSDKHIOT.Vr^GMSGAGKTVAIQSFEDLGYFTIDNMPPALVPKFLELIEQTNENRRVALVVDMRSRLFFK 

EINSTLDSIESNPSIDFRILFLDATDGELVSRYKETRRSHPLAADGRVLDGIRLERELLSPLKSMSQHWD 

TTKLTPRQDRKTISDQFSEGSNQASFRIEVMSFGFKYGLPIjDADLVFDVRFI,PNPYYQVELREKTGLDEDV 

FNYVMSHPESE^YKHLLNLIVPILPAYQKEGKSVLTVAIGCTGGQHRSVAFAHCLAESLATDWSVNESHR 
DQNRRKETVNRS 
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GAAATTAATTC TACCTTAGATAGTATTGAAAGC AATC CTAGC ATTGATTTTCGGATTC TTTTTTTGGATGC 
AAC GGATGGAGAATTGGTGTC AC GCTATAAAGAAACCAGAC GGAGCC ACCCTTTGGCTGC GGACGGTCGTG 
TGCTTGATGGTATTC GATTGGAAAGAGAACTC CTATCTC CTTTGAAAAGCATGAGCC AAC ATGTGGTGGAT 
ACAACAAAATTGACCCCTAGACAATTGCGTAAAACCATTTCAGAC^ 

CTCTTTCCGTATTGAAGTGATGAGCTTTGGGTTCAAATATGGTCTTCCTOT ■ 

ATGTGCGTTTTCTACCCAATCCTTATTATCAGGTAGAGCTTCGTGAAAAAACAGGACTAGATGAGGACGTT 

TTTAATTATGTGATGTCTCACCCAGAATCAGAGGTC^ 

CTTACCGGCTTACCAAAAAGAAGGGAAGTCTGTCTTG 

GCAGCGTTGCCTTTGCCCATTGCTTGGCAGAAAGTC 

GATCAAAATCGTCGTAAGGAAACGGTGAATCGTTCATGA 

SEQ ID NO: 108 amino acid sequence comprising GAS 84 

MI I KKRTVAILAIASSFFLVAC QATKSLKSGDAWGVYQKQKS ITVGFDNTFVPMGYKDESGRCKGFDIDLA 
KEVFHQYGIjKVNFQAINVJDMKEAEIiNNGKIDVIWNGYS ITKERQDKVAFTDS YMRNEQI IWKKRSDIKTI 
• SDMKHKVLGAQSASSGYDSLLRTPKLLKDFIKim)^^ 

GQLENYRMI PTTF ENEAF SVGIjRKEDKTIjQAKINRAFRVLiYQNGKFQAI SEKWFGDDVATANIKS 

SEQ ID NO: 109 polynucleotide sequence encoding GAS 84 

ATGATTATAAAAAAAAGAACCGTAGCAATTTTAGCCATAGCTAGT 

TACTAAAAGTCTTAAATCAGGAGATGCTTGGGGAGTTTACCAAAAGCAAAAAAGTATTACAGTTGGTTT^ 
ACAATACGTTTGTTCCTATGGGCTATAAGGATGAAAGCGGCAGATGCAAAGGTTTTGATATTGATTTGGCT 
AAAGAAGTTTTTCACCAATATGGACTCAAGGTTAACTTTCAAGCTATTAATTGGGACATGAAAGAAGCAGA 
ACTAAACAATGGTAAAATTGATGTAATCTGGAATGGTTATTCAATAACTAAGGAGCGTCAGGATAAGGTTG 
CCTTTACTGATTCTTACATGAGAAATGAACAAATTATTGTTGTCAAAAAAAGATCTGATATTAAAACAATA 
TCAGATATGAAACATAAAGTGTTAGGAGCACAATCAGCTTCATCAGGTTATGACTCCTTGTTAAGAACTCC 
TAAACTGCTGAAAGATTTTATTAAAAATAAAGACGCTAATCAATATGAAACCTTTACACAAGCTTTTATTG 
ATTTAAAATCAGATCGTATCGATGGAATATTGATTGACAAAGTATATGCCAATTACTATTTAGCAAAAGAA 

GGGCAATTAGAGAATTATCGGATGATCCCAACGACCTTTG 

AGAAGACAAAACGTTGCAAGCAAAAATTAATCGTGCTTTCAGGGTGCTTTATCA^ 
CTATTTCTGAGAAATGGTTTGGAGATGATGTTGCCACTGCCAATATTAAATCTTAA 

SEQ ID NO: 110 amino acid sequence comprising N-terminal leader sequence of GAS 84 

MI I KKRTVAILAI AS SFFLVA 

SEQ ID NO: 111 amino acid sequence comprising a fragment of GAS 84 where the N-terminal 
leader sequence is removed 

C Q ATKSLKSGDAWGVY QKQKS ITVGFD1TTFVPMGYKDE SGRCKGFDIDIxAKEVFHQYGIiKVNFQAXNWDMK 
EAELNNGKIDVIWNGYS ITKERQDKVAFTDS YMRNEQI I WKKRSDIKT I SDMKHKVLGAQS AS SGYDSLL 
RTPKLIjKDF IKNKDANQYETFTQAF IDIiKSDRIDGILIDKVYANYYIiAKEGQLENYRMI PTTFENEAF SVG 
LRKEDKTIjQAKINRAFRVLYQNGKFQAI SEKWFGDDVATANIKS 

SEQ ID NO: 112 amino acid sequence comprising GAS 384 

MKTIJ^DTSNKTLSLAILDDETIil^ 
LRVAVATAKTLAYSLNI ALVGI S SLYAI 

EVIIEQIiVEEGQLIFVGETAPFAEKIQKKLPQAILLPTIiPSAYECGIiLGQSLAPENVDA^ 
ENWLKDNEIKDDSHYVKRI 

SEQ ID NO: 113 polynucleotide sequence encoding GAS 384 

ATGAAGACACTTGCATTTGATACCTCAAATAAAACCTTGTCCCTTGCTATACTTGATGATGAGACACTTCT 
AGCAGATATGACCCTTAACATTCAGAAAAAACATAGTGTTAGCCOT 

CTTGTACTGATCTTAAACCTCAAGATTTAGAAAGAATAGTGGTTGCAAAAGGCCCTGGATCTTACACAGGT 

TTACGAGTGGCAGTTGCTACTGCAAAAACGTTAGCGTACAGTTTAAATATTGCATTGGT^ 

TCTATATGCTTTGGCTGCGTCTACTTGTAAACAGTATCCAAATACTTTGGTGGTGCCATT 

GAAGGCAAAATGCGTATGTAGGTTATTATCGGCAAGGAAAATCAGTGATGCCACAAGCCCATGCTTCACTA 

GAAGTTATTATAGAACAATTAGTAGAAGAAGGACAGCTGATTTTTGTTGGGGAGACTGCTCCTTTTGCTGA 
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GAAAATTCAAAAGAAACTACCTCAGGC 

TGGGGCAAAGTTTGGCACCAGAAAATGTAGACGCC^ 

GAAAACTGGCTCAAAGATAATGAGATAAAAGATGATAGTCACT 

SEQ ID NO: 114 amino acid sequence comprising GAS 202 

MLKMjV^ILGPLLIAFVLVVM 

SSEWSRMDSMHPSVLAERYKRSYRPFLIGKRGSA 

AVQMYLSNTQVIEFLIjKARTDKESQFAAKRIjIjEI^^ 

ESLFSFLGKSTNYEKRILPRVKGLPKVFSYKQIiNAI^^ 

QWTSYLASPEYNDFQIAjIjSEFAKRCT 

DFSKDGGESYFMQOTIHIiGWNGWIAFDKKVQPFLET^ 

SEQ ID NO: 115 polynucleotide sequence encoding GAS 202 

ATGCTTAAGAGACTCTGGTTAATTCTAGGTCCTCTTCTTA 

TAGTTTTCCTACACAACTTGATCATTCCATAGCTCAGGAAAAAGCAAATGCCGTTGCGATC 
CTTTTAAAAATGGTTTGATTAAAAGACAAGCTTT^^ 

TCTAGCGAATGGAGTCGAATGGATAGTATGCACCCTTCGGTGCTTGCAGAGCGCTACAAGCGGAGCTATAG 
ACCATTTTTAATTGGTAAGAGAGGATCAGCATCTTT 

AAATGCAAAAGAAAAAAGCCATCTTTGTAGTATCTCCTCAATGGTTTACTGCTCAAGGGATTAATCCTAGT 

GCGGTTCAGATGTACTTGTCTAACACTCAAGTGATTGAATTTTTACTAAAAGCTAGAACTGATAAAGAATC 
AGAGTTTGCAGCAAAGCGTTTGCTTGAGCTTAACC^ 

GTAAGGGTAAGTCTCTTAGTCGGTTAGACAGAGCTATTTTGAAATGTCAACATCAAGTAGCATTGAGAGAA 
GAGTCCCTTTTOAGTTTTTTAGG^ 

ACCTAAAGTAOTTTCGTATAAACAATTGAATGCATTAGCAACTAAGAGAGGCCAATTAGCAACAACCAACA 
ACCGTTTTGGGATTAAAAATACATTT^^ 

CAAGTTAATTATAGTTACCTGGCGTCACCAGAATACAATGATTTTCAGCTTTTATTATCAGAATTTGCTAA 
ACGAAAAACAGATGTACTCTTTGTTATAACTCCTGTTAATAAAGCTTGGGCGGATTATACCGGCTTAAATC 
AAGATAAGTATCAAGCGGCAGTTCGTAAAATAAAATTCCAGTTAAAGTCACAAGGATTTCATCGCATTGCT 
GACTTCTCAAAAGATGGTGGTGAGTCCTACTTTATGCAAGATACCATCCATCTCGGTTGGAATGGCTGGTT 
AGCTTTTGATAAGAAAGTGCAACCATTTCTAGAAACGAAGCAGCCAGTGCCCAACTATAAAATGAACCCTT 
ATTTTTATAGTAAAATTTGGGCAAATAGGAAAGACTTGCAATAG 

SEQ ID NO: 116 amino acid sequence comprising GAS 057 

MEKKQRFSLRKYKSGTFSVLIGSVFLVMTT^ 

SQITLKTNREKEQSQDLVSEPTTTELADTDAASMANTGSDATQKSASL 

QGKWAVIDTGIDPAHQSMRI SDVSTAKVKS KEDMLARQKAAG IN¥X3SWINDKWFAHNYVENSDNIKEN"Q 
FEDFDEDWEOTEFDAEAEPKAIKKHKIYRPQSTQAPKEW 

GI VAGNSKEAAATGERFLGI APEAQVMFMRVFANDIMGS AE S LF I KAI EDAVALGADVTNIjSLGTANGAQL 

SGSKPLMEAIEKAKKAGVSVWAAGNERVYGSDHD^ 

MTVKELENRADLNHGKAIYSESVDFKD^ 

DEMIALAKKHGAIjGVLIFNNKPGQSNRSMRLTANGMGI PSAFI SHEFGKAMSQIiNGNGTGSLEFDS WSKA 
PSQKGNEMNHFSNWGLTSDGYDKP 

NIi PKEKIAD I VKNIiLMSNAQI HVNPETKTTTS PRQQGAGIiLNIDGAVTSGLYVTGKDNYGS I SLGNITDTM 
TFDWVHNLSNKDKTLRYDTELM 

LTKQMPNGYYLEGFVRFRDSQDDQLITOVNIPFVGFKGQFENIiAVAEESIYRIjKSQGKTGFY 
YVGKHFTGIiVTLGS ETNVSTKTI SDNGLHTLGTFKNADGKF ILEKNAQGNPVLAI S PNGDNNQDFAAFKGV 
FLRKYQGLKASVYHASDKEHKNPLWSPESFKGDKNFNSDIRFAKSTTLLGTAFSGKSLTGAE 
WSYYPDWGAKRQEMTFDMILDRQKPVLSQATFDPETNRFKPEPL^ 

VTINDS YKYVS VEDNKTFVERQADG S F IL PLDKAKLGDFYYMVEDFAGNVAI AKLGDHLPQTLiGKTP IKLK 

LTDGNYQTKETLKDNIiEMTQSDTGLVTNQAQIjAVV^^ 

NVYNDLTVNVYAKDDHQKQT^ 

YTISVNDKKPMITQGRFDTINGVDHFTPDKTKALDSSGIVREEWYLAKKN 
YI PKNPDGS YTI SKRDGVTLSDYYYLVEDRAGWSFATIj^ 

LVRDADGKPIENLEYYl^SGNSLILPYGKYTVEIjLTYDTNAAKLESDKIVSFTIiSAD 
TSQITAHFDHLLPEGSRVSLKTAQDQLIPLEQSLYVPKAYGKTVQEGTYEVVVSLPKGYRIEGNTKVNTIj 
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• NEVHELSLRLVKVGDASDSTGDHKVM^ 
GLTCVF S RKKS TKD ■ " 

SEQ ID NO: 117 polynucleotide sequence encoding GAS 057 

GTGGAGAAAAAGC AACGTTTTTC CC TOAGAAAATAC AAATCAGGAACGTTTTC GGTCTTAATAGGAAGCGT 
TTTCTTGGTGATGACAAC AACAGTAGCA GC AGATGAGCTAAGC^CA ATQA(y , r;& a r*r» a a r & a n%n & rv^ a 

ACGCTCAACAACAAGCGCAACATCTCACCAATACAGAGTTGAGCTCAGCTGAATCAAAATCTCAAGACACA 
TCACAAATC AC TCTC AAGACAAATC GTGAAAAAGAGC AATC AC AAGATCTAGTCTCTGAGC C AACCAC AAC 

TGAGCTAGCTGACACAGATGCAGCATCAATGGCTAATACAGGTTCTGATGCGACTCAAAAAAGCGCTTCTT 
TACCGCGAGTCAATACAGATGTTCACGATTGGGTAAAAACCAAAGGA 

CAAGGCAAGGTTGTCGCAGTTATTGACACAGGGATCGATCCGGCCCATCAAAGCATGCGCATCAGTGATGT 

ATCAACTGCTAAAGTAAAATCAAAAGAAGACATGCTAGCACGCCAAAAAGCCGCCGGTATTAATTATGGGA 
GTTGGATAAATGATAAAGTTGTTTTTGGkCA 

TTCGAGGATTTTGATGAGGACTGGGAAAACTTTGAGTTTGATGCAGAGGCAGAGCCAAAAGCCATCAAAAA 

ACACAAGATCTATCGTCCCCAATC^CCCAGGCACCGAAAGAAACTGl^ATCAAAACAGAAGAAACAGATG 

GTTCACATGATATTGACTGGACACAAACAGACGATGACACCAAATACGAGTCACACGGTATGCATGTGACA 

GGTATTGTAGCCGGTAATAGCAAAGAAGCCGCTGCTACTGGAGAACGCTTTTTAGGAATTGCACCAGAGGC 

CCAAGTCATGTTGATGCGTGTTTTTGCCAACGACATCATGGGATCAGCTGAATCACTCTTTATCAAAGCTA 

TCGAAGATGCCGTGGCTTTAGGAGCAGATGTGATCAACCTGAGTCTTGGAACCGCTAATGGGGCACAGCTT 

AGTGGCAGCAAGCCTCTAATGGAAGCAATTGAAAAAGCTAAAAAAGCCGGTGTATCAGTTGTTGTAGCAGC 

AGGAAATGAGCGCGTCTATGGATCTGACCATGATGATCCATTGGCGACAAATCCAGACTATGGTTTGGTCG 
GTTCTCCCTCAACAGGTCGAACACCAACATGM3TGGCAGCTA 

ATGACGGTCAAAGAAOTAGAAAACCGTGCCGATTTAAACCATGGTAAAGCCATCTATTCAGAGTCTGTCGA 

CTTTAAAGACATAAAAGATAGCCTAGGTTATGATAAATCGCATCAATTTGCTTATGTCAAAGAGTCAACTG 

ATGCGGGTTATAACGCACAAGACGTTAAAGGTAAAATTGCTTTAATTGAACGTGATCCCAATAAAACCTAT 

GACGAAATGATTGCTTTGGCTAAGAAACATGGAGCTCTGGGAGTACTTATTTTTAATAACAAGCCTGGTCA 

ATCAAACCGCTCAATGCGTCTAACAGCTAATGGGATGGGGATACCATCTGCTTTCATATCGCACGAATTTG 

GTAAGGCCATGTCCCAATTAAATGGCAATGGTACAGGAAGTTTAGAGTTTGACAGTGTGGTCTCAAAAGCA 
CCGAGTCAAAAAGGCAATGAAATGAATCATTTT^^ 

TGACATTACTGCACCAGGTGGCGATATCTATTCTACCTATAACGATAACCACTATGGTAGCCAAACAGGAA 

CAAGTATGGCCTCTCCTCAGATTGCTGGCGCCAGCCTTTTGGTCAAACAATACCTAGAAAAGACTCAGCCA 

AACTTGCCAAAAGAAAAAATTGCTGATATCGTTAAGAACCTATTGATGAGCAATGCTCAAATTCATGTTAA 

TCCAGAGACAAAAACGACCACCTCACCGCGTCAGCAAGGGGCAGGATTACTTAATATTGACGGAGCTGTCA 

CTAGCGGCCTTTATGTGACAGGAAAAGACAACTATGGCAGTATATCATTAGGCAACATCACAGATACGATG 

ACGTTTGATGTGACTGTTCACAACCTAAGCAATAAAGACA2\AACATTACGTTATGACACAGAATTGCTAAC 

AGATCATGTAGACCCACAAAAGGGCCGCTTCACTTTGACTTCTCACTCCTTAAAAACGTACCAAGGAGGA 

AAGTTACAGTCCC^GCCAATGGAAAAGTGACTGTAAGGGTTACCATGGATGTCTCACAGTTCACAAAAGAG 

CTAACAAAACAGATGCCAAATGGTTACTATCTAGAAGGTTTTGTCCGCTTTAGAGATAGTCAAGATGACCA 

ACTAAATAGAGTAAACATTCCTTTTGTTGGTTTTAAAGGGCAATTTGAAAACTTAGCAGTTGCAGAAGAGT 

CCATTTACAGATTAAAATCTCAAGGCAAAACTGGTTTTTACTTTGATGAATCAGGTCCAAAAGACG 

TATGTCGGTAAAC ACTTTAC AGGAC TTGTC ACTC TTGGTTC AGAGACCAATGTGTCAACCAAAACGATTTC 

TGACAATGGTCTACACACACTTGGCACCTTTAAAAATGCAGATGGCAAATTTATCTTAGAAAAAAATGCCC 

AAGGAAACCCTGTCTTAGCCATTTCTCCAAATGGTGACAACAACCAAGATTTTGCAGCCTTCAAAGGTGTT 

TTCTTGAGAAAATATCAAGGCTTAAAAGCAAGTGTCTACCATGCTAGTGACAAGGAACACAAAAATCCACT 
GTGGGT<^GCCCAGAAAGCTTTAAAGGAGATAAAAACTTT^ 

CCC TGTT AGGC ACAGC ATTTTCTGGAAAATCGTTAAC AGGAGCTGAATTAC C AGATGGGC ATTATC ATTAT 
GTGGTGTCTTATTACCCAGATGTGGTCGGTGCCAAACGTCAAGAAATGACATTTGACATGATTTTAGACCG 
ACAAAAACCGGTACTATCACAAGCAACATTTGATCCTGAAACAAACCGATTCAAACCAGAACCCCTAAAAG 
ACCGTGGATTAGCTGGTGTTCGCAAAGACAGTGTCTTTTATCTAGAAAGAAAAGACAACAAGCCTTATACA 
GTTACGATAAACGATAGCTACAAATATGTCTCAGTAGAAGACAATAAAACATTTGTGGAGCGACAAGCTGA 
TGGCAGCTTTATCTTGCCGCTTGATAAAGCAAAATTAGGGGATTTCTATTACATGGTCGAGGATT^ 

GGAACGTGGCCATCGCTAAGTTAGGAGATCACTTACCACAAACATTAGGTAAAACACCAATTAAACTTAAG 
CTTACAGACGGTAATTATCAGACCAAAGAAACGCTTAAAGATAATCTTGAAATGACACAGTCTGACACAGG 
TCTAGTCACAAATCAAGCCCAGCTAGCAGTGGTGCACCGCAATCAGCCGCAAAGCCAGCTAACAAAGATGA 
ATCAGGATTTCTTTATCTCACCAAACGAAGATGGGAATAAAGACTTTGTGGCCTTTAAAGGCTTGAAAAAT 
AACGTGTATAATGACTTAACGGTTAACGTATACGCTAAAGATGACCACCAAAAACAAACCCCTATCTGGTC 
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TAGTCAAGCAGGCGCTAGTGTATCCGCTATTGAAAGTACAGCCTGGTATGGCATAAGA 
AGGTGATGCCAGGTGATTATCAGTATGTTGTGACCTATCGTC 

TACACCATATCTGTGAATGAGAAAAAACCAATGATCACTCAGOT 

CCACTTTACTCCTGACAAGACAAAAGCCCTTGACTCATCAGGCATTGTCCGCGA^ 

CCAAGAAAAATGGCCGTAAATTTGATGTGACAGAAGGTAAAGATGGTATCACAGTTAGTGA 

TATATC CC TAAT^AATCC AGATGGTTCTTACACCATTTC AAAAAGAGATGGTGTC AC ACTGTCA.GATT ATT A 

CTACCTTGTCGAAGATAGAGCTGGTAATGTGTCTTOTGCTA 

AC AAAGC AGTAGTCAATTTTGGATT AGACTTACCGGTC C CTGAAGAC AAAC AAAT AGTGAACTTT ACCT AC 

C TTGTGCGGGATGC AGATGGTAAACCGATTGAAAAC CTAGAGTATTATAATAACTC AGGTAAC AGTCTTAT 

CTTGCCATACGGCAAATACACGGTCGAATTGTTGACCTATGACACCAATGCAGCCAAACTAG^ 

AAATCGTTTCCTTTACCTTGTCAGCTGATAACAACTTCCAACAAGTTACCTTTAAGATAACGATGTTAGCA 

ACTTCTCAAATAACTGCCCACTTTGATCATCTTTTGCCAGAAGGCAGTCGCGTTAGCCTTA2VAACAGCTCA 

AGATCAGCTAATCCCGCTTGAACAGTCCTTGTATGTGCCTAAAGCTTATGGCAAAACCGTTCAAGAAGGCA 

CTTACGAAGTTGTTGTCAGCCTGCCTAAAGGCTACCGTATCGAAGGCAACACAAAGGTGAATACCCTACCA 

AATGAAGTGGACGAACTATCATTACGCCTTGTCAAAGTAGGAGATGCCTCAGATTCAACTGGTGATCATA^ 

GGTTATGTCAAAAAATAATTCACAGGCTTTGACAGCCTCTGCCACACCAACCAAGTCAACGACCTCAGCAA 

CAGCAAAAGCC CTACCATCAACGGGTGAAAAAATGGGTCTCAAGTTGCGCATAGTAGGTCTTGTGTTACTC 

GGACTTACTTGCGTCTTTAGCCGAAAAAAATCAACCAAAGATTGA 

. SEQ ID NO: 118 amino acid sequence comprising N-terminal leader sequence of GAS 57 

MEKKQRFSLRKYKSGTFSVIjIGSVFIjVMTTTVA 

* 

SEQ ID NO: 119 amino acid sequence comprising a fragment of GAS 57 where the N-terminal 
leader sequence is removed 

ADELSTMSEPTITNHAQQQAQHLTOTELSSAE^ 
MAOTGSDATQKS ASLP PVNTDVHDWVK^^ 
DMLARQKAAGINYGSWI1^K\M^AHNYVT^ 

QAPKETVI KTEETDGSHD IDWTQTDDDTKYESHGMHVTGI VAGNSKEAAATGERFLGI APEAQVMF^^^VFA 
NDIMGS AE SLF I KAI EDAVALGADVINL SLGTANGAQLSGSKPDMEAIEKAKKAWSVWAAGl^TERVYGSD 
HDDPLATNPDYGLVGSPSTGRTPTSVAAINSKW 
YDKSHQFAYVKESTDAGYNAQDVKGKIALIERDPNKTYD 

NGMG I P S AF I SHEFGKAMS QLNGNGTGSLiEFDSWS KAPSQKGNEMNHFSNWGIiTSDGYIiKPDH TAPGGDI 

YSTYNDimYGSQTGTSl^SPQIAGASIiliVKQYLEKTQPNIiPKl^ 

RQQGAGLLNIDGAVTSGLiYVTGKDNYGSISIjGNITO 

FTLTSHSIjKTYQGGEVTVPANGKVTVRVTMDVS QFTKEIiTKQMPNGYYLEGFVRFRD sqddqlnrvnx pfv 
gfkgqfenlavaee s I yrlksqgktgfyfdesgpkddi YVGKHFTGLVTLGS ETNVSTKTI SDNGLHTLGT 

FKNADGKF ILEKNAQGNPVIiAI S P^GDNNQDFAAFKGVFLRKYQGLKASVYHASDKEHKNPLWVS PE SFKG 
DKNFNSDIRFAKSTTLLGTAFSGKSIiTGAELPDGHYHYWSYYPDWGAKRQEMTFDM 
FDPETmFKPEPLKDRGLAGVRKDSVFYLERKDNKPYTVTIlSro 
AKLGDFYYMVEDFAGWAIAKLGDHLPQTLGKTPIK^ 

WHRNQPQS QLTKMNQDFF I S PNEIX3NKDFVAFKGIjKNN\nflTOLT^ S QAGASVS A 

I E STAWYGI TARGSKVMPGDYQYWTYRDEHGKEHQKQYTI SVNDKKPMI TQGRFDT INGVDHFTPDKTKA 

LDS SGI VREEVF YLAKKNGRKFDVTEGKDGITVSDNKVYI PKNPDGS YTISKRDGVTLSDYYYLVEDRAGN 

VSFATLRDLKAVGKDKAVVNFGLDLPVPEDKQIVNFTYLVRDADGKPIEIS^ 

LLTYDTNAAKIjESDKIVSFTLSADNNFQQVTFKIT^ 

IiWPKAYGKTVQEGTYEVVVSLPKGYRIEGNTKVNTLPNEVHELSLRL 

LTASATPTKSTTSATAKALPSTGEKMGLKLRIVGLVLLGIjTCVFSRKKSTKD 

SEQ ID NO: 120 amino acid sequence comprising C-terminal hydrophobic regiop 
LP STGEKMGLKLRI vglvllgltc vfsrkkstkd 

SEQ ID NO: 121 amino acid sequence comprising a fragment of GAS 57 where the C-terminal 
hydrophobic region is removed 

MEKKQRFSLRKYKSGTFSVIjIGSWLVMT^^ 

sqitlktnrekeqsqdlvsepttteladtdaa 
qgkwavidtgidpahqsmrisdvstakvkskedm^ 
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FEDFDEDWENFEFDAEAEPKAIKKH^ 

G IVAGNSKEAAATGERFLG IAPEAQVMFMRVFANDIMGS AE SLF IKAI EDAVALGADVINLiSLGTANGAQLi 
SGSKPLMEAIEKAKKAGVSVVVAAGNERVYGSDHDDPIiATNPDyGL 

• MTVKELENRADIiNHGKAI YS E SVDFKDI KD SLGYDKSHQF AYVKESTDAGYHAQDV1CGKI AIjI ERDPNKTY 
DEMIALAKKHGALGVIiIFNNKPGQSNRSMRIjTANGMGI PSAFI SHEFGKAMSQLNGNG-TGSLEFDSWSKA 
P SQKGNEMNHF SNWGLTSDGYIjKPDITAPGGDIYSTYNDNHYGSQTGTSMAS POXAGA SLIjVKQYIjEKTQP 
NLP^KIADrVKNIiLMSNAQIHVNPETKTTTS PRQQGAGXjLNIDGAWSGLYVTGKDNTTGS I SLGNTTDTM 
TFDVTVHNLSNKDKTIjRYDTELIiTDHVDPQKG 

LTKQMPNGYYLEGFVRFRDSQDDQLNRVNI PFVGFKGQFENLAVAEES I YRLKSQGKTGFYFDESGPKDDI 
YVGKHFTGLVTLrGSETNVSTKT I SDNGIiHTIjGTFKNADGKF ILEKNAQGNPVLAIS PNGDNNQDF AAFKGV 
FLRKYQGLKASVYHASDKEHKNPLWSPESFKGDKNFNSDIRFAKSTTLLGTAFSGKSL 
WSYYPDWGAKRQE^FDMILDRQKPVLSQATFDPETNRFKPEPLKDRGLAGVRK^ 

VTINDSYKWSVEDNKTFVERQADGSFILPIjDKAKLGDFYYMVEDFAGNVAIAKLGDHI^ 
LTDGNYQTKETLKDNLEMTQSDTGLVTN 

NVYNDLTVNVYAKDDHQKQTP^ 

YTISVNDKKPMITQGRFDTINGVDHFTPDKTKALDSSGIVREEVFYLAKK^ 
YIPKNPDGSYTISKKDGVTLSDYYYIjVEDRAGNVSFAT^ 

LVRDADGKP IENLEYYNNS GNSLI Ii PYGKYTVELLTYDTNAAKLESDKI VSFTLS ADNNFQQVTFKI TMLA 

• TSQ I TAHFDHLLPEGS RVSLiKTAQDQL I PLEQ SLYVPKAYGKTVQEGTYEVVVSLPKG YRIEGNTKVNTLP 
NEVHEIjSIjRIiVKVGDASDSTGDHKVMSKNNSQALTASATPTKSTTSATAKA 

SEQ ID NO: 122 amino acid sequence comprising a fragment of GAS 57 where both the N- 
terminal leader sequence and the C-terminal hydrophobic region are removed 

ADELSTMSEPTITNHAQQQAQHLTOTEL^ ^ 

MANTGSDATQKSASL PPYNTDVHDWVKTKGAWDKGYKGQGKWAVIIXrGIDPAH SDVSTAKVKSKE 
DMLARQKAAGINYGSWINDKVVFAHNYVENSDNIKENQ 

QAPKETVI KTEETDGSHDI DWTQTDDDTKYESHGMHVTGIVAGN SKEAAATGERFLGI APRAOVMrnvrR^^T? a 



NDIMGSAESLFIKAIEDAVALGADVIl^SLGT^ 
HDDPIiATNPDYGLVGSP STGRTPTSVAAINSKWVI QRLMTVKELtENRADLNHGKAI YSE SVDFKDIKDSLG 
YDKSHQFAYVKESTDAGYNAQDVKGKI AL I ERDPNKTYDEMI ALAKKHGALGVL IFNl^iCPGQ SIvTRSMRLTA 
NGMGIPS AF I SHEFGKAMSQIiNGNGTGSIiEFDSWSKAP SQKGNEMNHF SNWGLTSDGYTjKPDITAPGGDI 
YSTYM5NHYGSQTGTSMASPQI AGASLLVKQYLEKTQPNLPKEKIADIVKNLLMSNAQ3I HVNPETKTTTS P 
RQQGAGLLNIDGAVTSGLYVTGKDNYGSISLGNITDTMTFDVTVHNLSNKDKTLRYDTE 
FTLTSHSLKTYQGGEVTVP ANGKVTVRVTMDVSQFTKELTKQMPNGYYLEGFVRFRDSQDDQLITRVNI PFV 
GFKGQFENLAVAEE S I YRLKS QGKTGFYFDESGPKDDI YVGKHFTGIiVTLGS ETNVSTICTI SDNGLHTLGT 
FKNADGKF ILEKNAQGNPVL AI S PNGDNNQDFAAFKGVFIjRKYQGLKASVYHAS DKEHICNPIjWVS PESFKG 
DKNFNSDIRFAKSTTUjGTAF SGKSLTGAEL PDGHYHYWS YYPDWGAKRQEMTFDMI XiDRQKPVIiSQAT 
FDPETNRFKPEPLKDRGLAGVRKDSVFYLERKDNKPYTVTINDSYKWSVEDNK^ 

AKLGDF YYMVEDFAGNVAIAKLGDHLPQTLGKTP I KLKLTDGNYQTKETLKDNLEMTQS DTGIiVTNQAQLA 
WHRNQPQSQLTKMNQDFF I S PNEDGNKDFVAFKGLKNNVYNDIjTVNVYAKDDHQKQTP IWS SQAGAS VS A 
lESTAWGITARGSKVMPGDYQYVVTYRDEHGKEHQKQYTISVNDKKPMIT^ 

LDSSGI VREEVFYIiAKKNGRKFDVTEGKDGITVSDNKVYI PKNPDGSYTI SKRDGVTLS DYYYLVEDRAGN 
VSFATLRDLKAVGKDKAVVNFGLDLPVPEDKQIVNFTYLVRDADGKPI ENLEYYNNSGITS L I LP YGKYTVE 
LLTYDTNAAKLESDKIVSFTLSADNNFQQWFKITMIiATSQITAHFDHIiLPEGSRVSIiKTAQDQLIPIiEQS 
LYVPKAYGKTVQEGTYEVW SLPKGYRI EGNTKVNTI*PNEVHEIiSLRLVKVGDASDSTGI?HKVMSKNNS OA 
LTASATPTKSTTSATAKA 

SEQ ID NO: 123 amino acid sequence of a GAS M protein 

MAKNNTNRHYSLRKLKTGTAS VAVALTVLGAGFA RLRYENK 

DLKARLENAMEVAGRDFKRAEELEKAKQALEDQRKDLETKLKELQQDYDI^ 

ALELAIDQASRDYHRATALEKELEEKKKALELAIDQASQDYNRAWljEK^ 

JjDQL S SEKEQLTI EKAKLEEEKQI SDAS RQSLRRDIjDASREAKKQVEKDLANbTAEljDKVICEDKQI SDASR 

QGLRRDLDASREAKKQVEKDLANLTAELDKVKEEKQISDASRQGLRRDLDASREAKKQVEK^ 

ALEKLNKELEESKKLTEKEKAELQAKLEAEAKALKEQIJ^QAEELAKL 

GQAPQAGTKPNQNKAPMKETKRQLPSTGETANPFFTAAALTVMATAGVAAVVKRKEEN 
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SEQ ID NO: 124 amino acid sequence of GAS Sfbl 

MSEDGFFLHHLTNELKENLLYGRIQKVNQPFERELV^ 

POTFTMIMRKYLQGAVIEQLEQIDNDRIIEIKVSNKNEIGDAIQATIjI IEIMGKHSNI ILVDRAENKIIES 
IKHVGFSQNSYRTILPGSTYIEPPKTAAVOTFTITDVPLFEILQTQELTVKSLQQH 
LTTDKLKRFREFFARPTQANLTTASFAPVLFSDSHATFETLSDMI^HFY 
ELDKNRl^SKQEAELLATENAELFRQKGELLTTYliSLVP 

QRYFKKYQKIiKEAVKHLSGIiIADTKQS ITYFESVDYNLSQASIDDIED IREEL YQAGFLKSRQRDKRHKRK 
KP EQYIjASDGTTILMVGRNNIjQNEEIiTFKMAKKGELWFHAKDI PGSHVI I KDNLDP SDEVKTDAAEIiAAYY 
SKARLSNLVQVDMI EAKKLHKP SGAKPGFVTYTGQKTLRVTPDQAKIL SMKIiS 

SEQ ID NO: 125 amino acid sequence of a GAS Shp protein 

MTKWIKQLIjQVIWFMI SL STMTNLVYADKGQ I YGC I IQRNYRHP I SGQ I EDSGGEHS FD IGQGMVEGTV 

YSDAMLEVSDAGKrVLTFRMSLADYSGNYQFWIQPGGTGSFQAVDYNITQKGTDTNGTTLDI^ 

I IRGSMFVEPMGREVVF YIjS AS ELI QKYSGNMLAQLVTETDNS QNQEVKD SQKPVDTKLGESQDESHTGAM 

ITQl^PKANSSN^SLSDKKILPSKMGLTTSLELKKEDKFRSKKDLSIMIYW 
KKNDKTM 

SEQ ID NO: 126 amino acids 10 to 30 of GAS protein SagA 

FS IATGSGNSQGGSGS YTPGKC 

SEQ ID NO: 127 polynucleotide sequence comprising fusion construct 117-40a-RR 

ATGGCCTTTAACACAAGCCAGAGTGTCAGTGCACAAGTTTATAGCAATGAAGGGTATCACCAGCATTTGAC 
TGATGAAAAATCACACCTGC7^TATAGTAAAGACAACGCACAACTTCAATTGAGAAATATCCTTGACGGCT * 
ACCAAAATGACCTAGGGAGACACTACTCTAGCTATTATTACTACAACCTAAGAACCGTTATGGGACTATCA 



CGAAGGCGAGTAATACTCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCAAAGGCAACTATT 

GATGCAGTTGAAAAAACTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAAC 

TACTGCTGAAATCAACCACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAACCTCTGCACAAGAAA 

TTTACACTAATACTCTTGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTA 

ACAGCTACTGAAACAGAGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTCAGAACA 

AAAAGCTAGCATTTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAA.CAAGTCAAAACGTCTGAACAAA 

ATATTGCTAAGCTCAATGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGAT 

AATACAAAAGCATTAAGCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAAAAAGCTAAAGTTAA 

AAAGCAATTGACTGAAGAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGA2\AAAGAGGCAGAACTTAGTCGTC 

TTAAATCCTCAGCTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTAT 
CCTCTTGAAGAACTTAAAAAATTAGAAGCTAGTGGTTAT 

AGAGCATGCAGATCAAATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATCAATACCAAGATATTCCAG 

CAGATCGTAATCGCTTTGTTGATCCCGATAATTTGACACCAGAAGTGCAAAATGAGCTAGCGCAGTTTGCA 

GCTCACATGATTAATAGTGTAcGtcGtCAATTAGGTCTACCACCAGTTA.CTGTTACAGCAGGATCACAAGA 

ATTTGCAAGATTACTTAGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGAC 
AGCCAGGGGTATCAGGGCATTATGGTGTTGGGCCT^ 

TCAGGGCTCATTCGAAATGATGATAACATGTACGAGAATATCGGTGCTTTTAACGATGTGCATACTGTGAA 

TGGTATTAAACGTGGTATTTATGACAGTATCAAGTATATGCTCTTTACAGATCATTTACACGGAAATACAT 

ACGGCCATGCTATTAACTTTTTACGTGTAGATAAACATAACCCTAATGCGCCTGTTTACCTTGGATTTTCA 
ACCAGCAATGTAGGATCTTTGAATGAACACTT^^ 

CTTTAATAAGACCCCTATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGCACTGTATCTG 
ATACTATTGCAGCGATCAAAGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCT 
GATATTATGGCAGCCCAAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACACTTAAGCAGTCAGA 
CAGCTTAAATCTCCAAGTGAGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTA 
AAGCAAAACAAGCACAACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAA^ 

CTGCACCAGACAGAAGCCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCA 
TTTGCAATATCTAAGGGACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGAGCGCATTGATAATA 
CTAAGCAAGATTTGGCTAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCCTTACAAGCT 
AAACAAAGCAGTCTAGAAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAA 
CGAAAAGGAATATCGCCACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTA 



34/38 



WO 2005/032582 PCT/US2004/024868 

SEQUENCE LISTING 

CGGGCGTAAAAC CGCTATCATATAGTAAGATAGATAC TAC TCCGCTTGTTC AAGAAATGGTTAAAGAAACG 
AAACAACTATCAGAAGCTTCAGCAAGA 

CCAAACCTCTGAAATGGTAGCAAGTAATGCCATTGTGTC taaaatc acatc ttcgattactc agc cctcat 

CTAAGACATCTTATGGCTC AGGATCTTC TAC AAC GAGC AATCTCATTTCTGATGTTGATGAAAGTACTCAA 
cGt gcggccgc ac t cgagCACCACCACC ACCACCAC 

SEQ ID NO: 128 amino acid sequence comprising fusion construct 117-40a-RR 
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SEQ ID NO: 129 amino acid sequence comprising a linker in the 1 17-40a-RR construct 
YASGGGS 



SEQ ID NO: 130 polynucleotide sequence comprising 40a-RR-117 fusion construct 

ATGAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAACGAAGGCGAGTAATAC 
TCACGACGATAGTTTACCAAAACCAGAAACA 

CTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAACTACTGC 
CACTTAAAAGAGCAGCAAGATAATGAACAAAA 

TGCAAGTAGTGAGGAGACGCTATOAGCCCAAGGAGCCGAACATCAAAGAGAGTTAACAGCTACTGAAACAG 

AGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTCAGAACAAAAAGCTAGCATTTCA 

GCAGAAACTACTCGAGCTCAAGATTTAGTGGAAGAAGTCiVAAACGTCTGAACA^ 

TGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGATAATACAAA^ 

GCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAAAJVA 

GAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTGGTCTTAAATCCTCA 

GTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAA.GCACCGCAAGGCTATCCTCTTGAAGAACTTA 

AAAAATTAGAAGCTAGTGGTOATATTGGATCAGCTAGTTACAATAATOATTACAA^ 

ATTATTGCCAAAGCTAGTCCAGGTAATCAATTAAATC 

TGTTGATCCCGATAATTTGACACCAGAAGTGCAA 

GTGTAcGtcGtCAATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGAATTTO 
AGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGACAGCCAGGGGTATCAGG 
GCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAA.GACTCTGCCGGAGCGTCA 
ATGAxX^TAAGATGTACGAGAATATCGGTGCTTTTAACGAwTGTGCATACTGTGAATGGTATTAAACGTC 
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ATTTATGACAGTATCAAGTATATGCTC 

CTTTTTACGTGTAGATAAACATAACCC TAATGC GCCTGTTTACCTTGGATTTTC AACCAGCAATGTAGGAT 
CTTTGAATGAACACTTTGTAATGTTTCCAG^ 

ATAAAAGCCGTTGGAAGTACAAAAGATTATGCCCAAAGAGTAGGC^ 
CAAAGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCT 
AAGCTAAAGTAAGTCAACTTCAAGGTAAATTAG^ 
GTGAGACAATTAAATGATACTAAAGGTTCTTT^^ 

ACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCACTGCACCAGACAGAAG 

CCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCATTTGCAATATCTAAGG 
GACTTTAAATTGAATCCTAACCGCCTTCAAGTGA^ 

TAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCCTTACAAGCTAAACAA 

AAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAACGAAAAGGAATATCGC 

CACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTACGGGCGTAAAACCGCT 

ATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAATGGTTAAAGAAACGAAACAACTATTAGAAG 
CTTCAGCAAGATTAGCTGCTGAAAATACAAGTCOT 

GTAGCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCCCTCATCT AAGACATCTTATGG 
CTCAGGATCTTCTACj^^ 

^I^ATGGCCTTTAACACAAGCCAGAGTGTCAGTGCACAAGTTTATAGCAATGAAGGGTATCACCAG^ 

TTGACTGATGAAAAATCACACCTGCAATATAGTAAAGAGAACGCACAACTTCAATTGAGAAATATCCTTGA 

CGGCTACCAAAATGACCTAGGGAGACACTACTCTAGCTATTATTACTACAACCTAAGAACCGTTATGGGAC 

TATCAAGTGAGCAAGACATTGAAAAACACTATG^^GAGCTTAAGAACAAGTTACATGATATGTACAATCAT 
TATgcggccgcactcgagCACCACCACCACCACOAC 



SEQ ID NO: 131 amino acid sequence comprising the 40a-RR-117 fusion construct 



M 


S 


V 


G 


V 


S 


H 


Q 


V 


K 


A 


D 


D 


R 


A 


S 


G 


E 


T 


K 


A 


S 


N 


T 


H 


D 


D 


S 


L 


P 


K 


P 


E 


T 


I 


» 

Q 


£ 


A 


K 


A 


T 


I 


D 


A 


V 


E 


K 


T 


L 


S 


Q 


Q 


K 


A 


E 


L 


T 


E 


L 


A 


•T 


A 


L 


T 


K 


T 


T 


A 


E 


I 


N 


H 


L 


K 


6 


Q 


Q 


D 


N 


E 


Q 


K 


A 


Ii 


T 


S 


A 


Q 


E 


I 


Y 


T 


N 


T 


L 


A 


S 


S 


E 


E 


T 


L 


Ii 


A 


Q 


G 


A 


E 


H 


Q 


R 


E 




T 


A 


T 


E 


T 


E 


L 


H 


N 


A 


Q 


A 


D 


Q 


H 


S 


K 


E 


T 


A 


L 


S 


E 


Q 


K 


A 


S 


I 


S 


A 


E 


T 


T 


R 


A 


Q 


D 


Li 


V 


E 


Q 


V 


K 


T 


S 


E 


Q 


N 


I 


A 


K 


L 


N 


A 


M 


I 


S 


N 


P 


D 


A 


I 


T 


K 


A 


A Q 


T 


A 


N 


D 


N 


T 


K 


A 


L 


S 


S 


E 


L 


E 


K 


A 


K 


A 


D 


L 


E 


N 


Q 


K 


A 


K 


V 


K 


K 


Q 


L 


T 


E 


E 


L 


A 


A 


Q 


K 


A 


A 


L 


A 


E 


K 


E 


A 


E 


L 


S 


R 


L 


K 


S 


S 


A 


P 


S 


T 


Q 


D 


S 


I 


V 


G 


N 


N 


T 


M 


K 


A 


P 


Q 


G 


Y 


P 


It 


E 


E 


Ii 


K 


K 


L 


E 


A 


S 


G 


Y 


I 


G 


S 


A 


S 


Y 


N 


N 


Y 


Y 


K 


E 


H 


A 


D 


Q 


I 


I 


A 


K 


A 


S 


P 


G 


N 


Q 


Ii 


N 


Q 


Y 


Q 


D 


I 


P 


A 


D 


R 


N 


R 


F 


V 


D 


P 


D 


N 


L 


T 


P 


E 


V 


Q 


N 


E 


Ii 


A 


Q 


F 


A 


A 


H 


M 


I 


N 


S 


V 


R 


R 


Q 


Ii 


G 


Ii 


P 


P 


V 


T 


V 


T 


A 


G 


S 


Q 


E 


F 


A 


R 


L 


Ii 


S 


T 


S 


Y 


K 


K 


T 


H 


G 


N 


T 


R 


P 


S 


F 


V 


Y 


G 


Q 


P 


G 


V 


S 


G 


H 


Y 


G 


V 


G 


P 


H 


D 


K 


T 


I 


I 


E 


D 


S 


A 


G 


A 


S 


G 


L 


I 


R 


N 


D 


D 


N 


M 


Y 


E 


N 


I 


G 


A 


F 


N 


D 


V 


H 


T 


V 


N 


G 


I 


K 


R 


G 


I 


Y 


D 


s 


I 


K 


Y 


M 


L 


F 


T 


D 


H 


L 


H 


G 


N 


T 


Y 


G 


H 


A 


I 


N 


F 


Ii 


R 


V 


D 


K 


H 


N 


P 


N 


A 


P 


V 


Y 


Ii 


G 


F 


S 


T 


S 


N 


V 


G 


S 


L 


N 


E 


H 


F 


V 


M 


F 


P 


E 


S 


N 


I 


A 


N 


H 


Q 


R 


F 


N 


K 


T 


P 


I 


K 


A 


V 


G 


S 


T 


K 


D 


Y 


A 


Q 


R 


V 


G 


T 


V 


S 


D 


T 


I 


A 


A 


I 


K 


G 


K 


V 


S 


S 


L 


E 


N 


R 


L 


S 


A 


I 


H 


Q 


E 


A 


D 


I 


M 


A 


A 


Q 


A 


K 


V 


S 


Q 


L 


Q 


G 


K 


Ii 


A 


s 


T 


L 


K 


Q 


S 


D 


S 


L 


N 


L 


Q 


V 


R 


Q 




N 


D 


T 


K 


G 


S 


L 


R 


T 


E 


L 


L 


A 


A 


K 


A 


K 


Q 


A 


Q 


Ii 


E 


A 


T 


R 


D 


Q 


S 


L 


A 


K 


L 


A 


S 


L 


K 


A 


A 


L 


H 


Q 


T 


E 


A 


L 


A 


E 


Q 


A 


A 


A 


R 


V 


T 


A 


Ii 


V 


A 


K 


K 


A 


H 


Ii 


Q 


Y 


L 


R 


D 


F 


K 


L 


N 


P 


N 


R 


L 


Q 


V 


I 


R 


E 


R 


I 


D 


N 


T 


K 


Q 


D 


L 


A 


K 


T 


T 


S 


S 


L 


L 


N 


A 


Q 


E 


A 


L 


A 


A 


L 


Q 


A 


K 


Q 


S 


S 


L 


E 


A 


T 


I 


A 


T 


T 


E 


H 


Q 


L 


T 


Ii 


L 


K 


T 


Ii 


A 


N 


E 


K 


E 


Y 


R 


H 


L 


D 


E 


D 


I 


A 


T 


V 


P 


D 


L 


Q 


V 


A 


P 


P 


L 


T 


G 


V 


K 


P 


Ii 


S 


Y 


S 


K 


I 


D 


T 


T 


P 


L 


V 


Q 


E 


M 


V 


K 


E 


T 


K 


Q 


Ii 


L 


E 


A 


S 


A 


R 


L 


A 


A 


E 


N 


T 


S 


Ii 


V 


A 


E 


A 


L 


V 


G 


Q 


T 


S 


E 


M 


V 


A 


S 


N 


A 


I 


V 


S 


K 


I 


T 


S 


S 


I 


T 


Q 


P 


S 


S 


K 


T 


S 


Y 


G 


S 


G 


S 


S 


T 


T 


S 


N 


L 


I 


S 


D 


V 


D 


E 


S 


T 


Q 


R 


5$ 


" J ' r tTkT 

51, 


HQ? 


M 




M 


A 


F 


N 


T 


S 


Q 


S 


V 


S 


A 


Q 


V 


Y 


S 


1ST 


E 


G 


Y 


H 


Q 


H 


Li 


T 


D 


E 


K 


S 


H 


Ii 


Q 


Y 


r 

"s 


K 


D 


N 


A 


Q 


L 


Q 


L 


R 


N 


I 


L 


D 


G 


Y 


Q 


N 


D 


L 


G 


R 


H 


Y 


s 


S 


Y 


Y 


Y 


Y 


N 


L 


R 


T 


V 


M 


G 




S 


S 


E 


Q 


D 


I 


E 


K 


H 


Y 


E 


E 


L 


K 


N 


K 


L 


H 


D 


M 


Y 


N 


H 


Y 


A 


A 


A 


L 


E 


H 


H 


H 


H 


H 


H 





SEQ ID NO: 132 polynucleotide sequence comprising fusion construct GAS 117 - 40a 

atggcctttaacacaagccagagtgtcagtgcacaagtttatagcaatgaagggtatcaccagcatttgac 
tgatgaaaaatcacacctgcaatatagtaaagacj^cgcacaacttcaattgagaaatatccttgacggct 
accaaaatgacctagggagacactactctagctattattactacaacctaagaacJgttatgggactatca 
. agtgagcaagacattgaaaaacactatgaagagc ttaagaacaagttacatgatatgtac aatcattatge 
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CGAAGGCGAGTAATACTCACGACGATAGTTTACCAAAACC^^ 
GATGCAGTTGAAAAAACTCTCAGTCAACAAAAAGCA^ 
TACTGCTGAAATCAACC^C^TAAAAGA 
TTTACACTAATACTCTTGCAAXST^^ 

ACAGCTACTGAAACAGAGCTO?CATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTCAGAACA 

AAAAGCTAGCATTTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAAAACGTCTGAACAAA 
ATATTGCTAAGCTCAATGCTATGATTAGCAATC 

AATACAAAAGCATTAAGCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAA 

AAAGCAATTGACTGAAGAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTC 
TTAAATCCTCAGCTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTAT 

cctcttgaagaacttaaaaa^ttagaagctagtggttatattggatcagctagttacaataattattacaa 
agagcatggagatcaaattattgccaaagctagtccaggt^ 

cagatcgtaatcgctttgttgatcccgataatttgacaccagaagtgcaaaatg|gctagcgcagtttgca 

GCTCACATGATTAATAGTGTAAGAAGACAATTAGGTCTACCACCAGTTACTGTTACAGCAGGATCACAAGA 

ATTTGCAAGATTACTTAGTACCAGCTATAAGAAAACTCATGGTAATACAAGACCATCATTTGTCTACGGAC 

AGCCAGGGGTATCAGGGCATTATGGTGTTGGGCCTCATGATAAAACTATTATTGAAGACTCTGCCGGAGCG 

TCAGGGCTCATTCGAAATGATGArfAACATGTACGAGAATATCGGTGCTTTTAACGATGTGCATACTGTGAA 

TGGTATTAAACGTGGTATTTA.TGACAGTATCAAGTATATGCTCTTTACAGATCATTTACACGGAAATACAT 

ACGGCCATGCTATTAACTTTTTACGTGTAGATAAACATAACCCTAATGCGCCTGTTTACCTTGGATTTTCA 

ACCAGCAATGTAGGATCTTTGAATGAACACTTTGTAATGTTTCCAGAGTCTAACATTGCTAACGA 

CTTTAATAAGACCCCTATAAAAGCCGTTGGAAGTAGAAAAC^TTATGCCCAAAGAGTAGGCACTGTATCTG 

ATACTATTGCAGCGATCAAAGGAAAAGTAAGCTCATTAGAAAATCGTTTGTCGGCTATTCATCAAGAAGCT 

GATATTATGGCAGCCCAAGCTAAAGTAAGTCAACTTCAAGGTAAATTAGCAAGCACACTTAAGCAGTCAGA 

CAGCTTAAATCTCCAAGTGAGACAATTAAATGATACTAAAGGTTCTTTGAGAACAGAATTACTAGCAGCTA 

AAGCAAAACAAGCACAACTCGAAGCTACTCGTGATCAATCATTAGCTAAGCTAGCATCGTTGAAAGCCGCA 

CTGCACCAGACAGAAGCCTTAGCAGAGCAAGCCGCAGCCAGAGTGACAGCACTGGTGGCTAAAAAAGCTCA 

TTTGCAATATCTAAGGGACTTTAAATTGAATCCTAACCGCCTTCAAGTGATACGTGAGCGCATTGATAATA 

CTAAGCAAGATTTGGCTAAAACTACCTCATCTTTGTTAAATGCACAAGAAGCTTTAGCAGCCTTACAAGCT 

AAACAAAGCAGTCTAGAAGCTACTATTGCTACCACAGAACACCAGTTGACTTTGCTTAAAACCTTAGCTAA 

CGAAAAGGAATATCGCCACTTAGACGAAGATATAGCTACTGTGCCTGATTTGCAAGTAGCTCCACCTCTTA 

CGGGCGTAAAACCGCTATCATATAGTAAGATAGATACTACTCCGCTTGTTCAAGAAATGGTTAAAGAAACG 

AAACAACTATTAGAAGCTTCAGCAAGATTAGCTGCTGAAAATACAAGTCTTGTAGCAGAAGCGCTTGTTGG 

CCAAACCTCTGAAATGGTAGCAAGTAATGCCATTGTGTCTAAAATCACATCTTCGATTACTCAGCCCTCAT 

CTAAGACATCTTATGGCTCAGGATCTTCTACAACGAGCAATCTCATTTCTGATGTTGATGAAAGTACTCAA 
cGtgcggccgcactcgagCACCACCACCACCACCAC 

SEQ ID NO: 133 amino acid sequence comprising fusion construct GAS 117-40a 
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SQIiQGKLASTIiKQSDSLNLQVRQIi'NDTKGSLRTELIi 
AAKAKQAQ LEATRDQSLAKIiASIiKAAIiHQTEALAEQ 
AAARVTALtVAKKAHLQYliRDFKLNPNRLQVIRERID 
NTKQDIiAKTTSSLLNAQEAIiAALQAKQSSLEATIAT 
TEHQLTLL, KTIiANEKEYRHLDEDIATVPDLQVAPPIi 
TGVKPLSY" SKIDTTPIiVQEMVKETKQLLEASARLAA 
ENTSLVAE ALVGQTSEMVASNAIVSKITSS ITQPSS 
KTSYGSGS STTSNLI SDVDESTQRA* AALEHHHHHH 

* - 

SEQ ID NO: 134 polynucleotide sequence comprising fusion construct GAS 117-40N 

ATGGCCTTTAACACAAGCCAGAGTGTCAGTGCACAAGTTTATAGCAATGAAGGGTATCACCAGCA 

TGATGAAAAATCACACCTGCAATATAGTAAAGACAACGCACAACTTCAATTGAGAAATATCCTTGACGGCT 

ACCAAAATGACCTAGGGAGACACTACTCTAGCTATTATTACTACAACCTAAGAACCGTTATGGGACTATCA 
AGTGAGGiAC^ 

^^li^l^^^^TGAGTGTAGGCGTATCTCACCAAGTCAAAGCAGATGATAGAGCCTCAGGAGAAA 

CGAAGGCGAGTAATACTCACGACGATAGTTTACCAAAACCAGAAACAATTCAAGAGGCAAAGGCAACTATT 

GATGCAGTTGAAAAAACTCTCAGTCAACAAAAAGCAGAACTGACAGAGCTTGCTACCGCTCTGACAAAAAC 

TACTGCTGAAATCAACCACTTAAAAGAGCAGCAAGATAATGAACAAAAAGCTTTAACCTCTGCACAAGAAA 

TTTACACTAATACTCTTGCAAGTAGTGAGGAGACGCTATTAGCCCAAGGAGCCGAACATCAAAGAGAGTTA 

ACAGCTACTGAAACAGAGCTTCATAATGCTCAAGCAGATCAACATTCAAAAGAGACTGCATTGTCAGAACA 

AAAAGCTAGCATTTCAGCAGAAACTACTCGAGCTCAAGATTTAGTGGAACAAGTCAAAACGTCTGAACAAA 

ATATTGCTAAGCTCAATGCTATGATTAGCAATCCTGATGCTATCACTAAAGCAGCTCAAACGGCTAATGAT 

AATACAAAAGCATTAAGCTCAGAATTGGAGAAGGCTAAAGCTGACTTAGAAAATCAAAAAGCTAAAGTTAA 

AAAGCAATTGACTGAAGAGTTGGCAGCTCAGAAAGCTGCTCTAGCAGAAAAAGAGGCAGAACTTAGTCGTC 

TTAAATCCTCAGCTCCGTCTACTCAAGATAGCATTGTGGGTAATAATACCATGAAAGCACCGCAAGGCTAT 

CCTCTTGAAGAACTT.AAAAAATTAGAAGCTAGTGGTTAT^ 

AGAGCATGCAGATCAAATTATTGCCAAAGCT^^ 

t cgagCACCACCACCACCACCAC 

SEQ ID NO: 135 
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SEQ ID NO: 136 

AGTTGGTA 
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